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I.  INTRODUCTION 


This  document  constitutes  MATHEMATICAL  Final 
Report  on  project  work  undertaken  for  the  Office  of  Naval 
Research  under  contract  N00014-66-C-0215.  The  central 
aim  of  the  research  undertaken  in  this  project  was  to  inves¬ 
tigate  the  application  of  mathematical  game  theory  to  ASW 
detection  problems. 

The  major  findings,  conclusions,  and  recommenda¬ 
tions  which  have  resulted  from  this  project  are  described  in 
Chapter  H,  "Game -Theoretic  Models  for  ASW",  in  non¬ 
technical  language.  Chapter  HI,  "Game-Theoretic  Analyses 
of  ASW  Problems",  consists  of  a  series  of  technical  papers 
examining  various  facets  of  ASW  problems  and  models. 

The  work  on  this  project  was  performed  by  the  follow¬ 
ing  members  of  MATHEMATICAL  staff:  Norman  I.  Agin, 
Michel  L.  Balinski,  Harold  W.  Kuhn,  John  P.  Mayberry, 


and  Francis  M.  Sand. 


II.  1  Construction  of  Game -Theoretic  Models 


The  basic  aim  in  the  research  undertaken  thus  far 
has  been  to  introduce  strategic  choices  into  the  problem  of 
submarine -submarine  detection  and  to  introduce  them  in  a 
way  that  can  be  analyzed  by  the  mathematics  of  game  theory. 
This  research  has  consisted  of  a  sequence  of  models  which 
have  incorporated  successively  more  strategic  features  and 
more  parameters  designed  to  better  represent  the  real 
situation. 

To  understand  the  basic  structure  of  these  models,  it 
is  useful  to  recall  the  underlying  theory  in  which  they  are  set, 
namely,  zero-sum  two-person  game  theory.  This  theory 
of  conflict  deals  with  situations  in  which  two  opposing  parties 
make  strategic  choices  that  control  their  actions  throughout 
a  particular  contest.  The  rules  of  the  game  define  precisely 
the  strategies  available  to  each  of  the  opposing  parties,  and 
determine  the  outcome  for  each  player,  when  a  specific  stra¬ 
tegy  is  chosen  by  each  party.  This  outcome  is  measured  by 
a  numerical  payoff  to  each  player  as  a  result  of  the  contest 
and  which  is  a  function  of  the  strategies  chosen  by  the  two 
parties  to  the  conflict.  If  there  are  but  two  parties  to  the 
conflict  and  if  what  one  wins  the  other  loses,  then  the  game 
is  called  zero-sum  two -person  and  assumes  a  particularly 


simple  formal  structure.  Namely,  the  game  may  be  described 
by  the  sets  X  and  Y  of  strategies  available  respectively  to 
the  first  player  and  to  the  second  player,  and  a  real-valued 
function  f(x,  y)  defined  for  every  choice  of  x  t  X  and 
y  £  Y  .  (Conventionally,  f  is  taken  to  be  the  payoff  paid  to 
the  first  player  by  the  second  player.) 

In  various  applications  the  strategy  sets  X  and  Y 
take  on  different  forms.  For  instance,  they  may  be  chosen  to 
be  finite  discrete  sets  --  in  which  :ase,  the  payoff  function 
is  a  matrix  with  real  entries.  In  other  cases,  they  may  be 
taken  to  be  all  probability  mixtures  of  a  finite  set  of  distinct 
elements  --  this  is  the  familiar  case  of  "mixed  strategies" 
for  finite  zero-sum  two-person  games.  In  other  instances, 
the  sets  X  and  Y  may  take  a  structure  dictated  by  the 
essential  features  of  the  context  under  study.  For  example, 
if  the  second  player  is  a  transiting  submarine  crossing  a 
rectangular  barrier,  Y  may  consist  of  all  possible  speeds 
at  which  he  may  travel,  if  we  believe  that  this  is  the  only 
relevant  parameter  in  the  problem. 

The  structure  of  the  models  studied  in  our  research 
may  now  be  explained  within  this  framework.  First,  and  very 
important  in  its  consequences,  all  of  our  analyses  have  been 
carried  out  with  the  payoff  function  giving  the  probability  of 
first  detection.  (Here  and  throughout  the  reports  the  first 


-4- 


player  is  a  submarine  patrolling  the  barrier  and  the  second 


player  is  a  submarine  transiting  the  barrier.)  Whatever  the 
strategy  spaces  X  and  Y  used  for  the  patroller  and  the 
transitor,  respectively,  the  first  step  in  fully  defining  the 
game  has  been  to  compute  the  probability  of  first  detection 
if  the  patroller  chooses  x  i  X  and  the  transitor  chooses 

y  e  y. 

A  second  important  feature  of  the  analysis  is  the 
choice  of  a  solution  concept  for  the  games  after  they  are  de¬ 
fined.  Two  distinct  approaches  have  been  adopted.  The 
first  is  a  technique  which  has  been  used  in  situations  which 
are  either  too  complicated  to  permit  a  solution  with  strategic 
choices  on  both  sides  or  have  a  "reasonable"  fixed  strategy 
for  one  of  the  players.  (An  example  of  the  latter  is  provided 
by  a  "house"  strategy  for  card  games  such  as  Blackjack. ) 
Precisely,  if  we  fix  the  strategy  of  the  second  player  to  be 
y  t  Y  ,  the  problem  of  the  first  player  becomes  simply: 

Find  x  =  x  so  as  to  maximize  f(x,  y)  .  If  we  were  certain 
that  the  second  player  would  use  y  then  the  first  player 
surely  can  do  no  better  than  ploy  x  . 

The  second  solution  concept  which  has  been  used  ex¬ 
tensively  in  our  analyses  has  been  that  of  a  largest  assured 
payoff  or,  somewhat  more  technically,  of  a  maximin  strategy. 
The  motivation  of  this  is  clear.  For  each  (patrol)  strategy 


-5- 


x  there  is  a  counter  (transit)  strategy  y  that  minimizes  the 
probability  of  first  detection.  The  value  of  this  probability 


is  given  by  min  f(x,  y);  it  is  the  worst  that  the  first  player 
can  gain  if  he  plays  x  .  He  then  chooses  x  so  as  to  maxi¬ 
mize  this  probability,  giving  him  a  probability  of  first  detec¬ 
tion  equal  to 


max  min  f(x,  y)  . 

x  y  ’ 

This  probability  is  a  "sure  thing"  for  him;  he  may  do  better 
if  the  second  player  does  not  counter  him  optimally  but  on  the 
average  he  will  do  at  least  as  well.  It  is  also  true  that  this 
is  the  highest  probability  of  first  detection  that  he  can  assure 
him  8  elf. 


.6. 


.11.  2  Analysis  of  Models;  A  Summary  of  Results 


We  are  now  in  a  position  to  describe  how  the  sequence 
of  models  that  have  been  analyzed  has  been  built  up.  The 
first  models  were  studied  with  the  fiist  solution  concept. 
Namely,  various  classes  of  patrol  strategies  X  were  played 
against  an  essentially  fixed  transit  strategy  y  .  This  tech¬ 
nique  is  appropriate  to  the  optimization  of  certain  key  para¬ 
meters  ir.  the  set  X  .  It  is  well  suited  to  such  questions  as; 
What  is  the  optimal  angle  for  a  bow-tie  patrol  against  transits 
in  one  direction?  This  technique  of  simple  optimization  against 
a  fixed  opponent's  strategy  (which  may  be  a  probability  mix  of 
a  class  of  strategies)  is  close  in  spirit  to  the  kinds  of  ques¬ 
tions  posed  in  gaming  and  simulation  approaches  to  the  sub¬ 
marine-submarine  problem. 

What  kinds  of  strategy  spaces  X  and  Y  have  been 
studied?  For  the  patrol  submarine,  the  emphasis  has  been 
largely  either  on  the  pattern  of  patrol  (varying  the  param  „ers 
of  the  pattern)  or  on  the  speed  of  the  patrol  (keeping  the  pat¬ 
tern  essentially  fixed).  For  the  transiting  submarine  the 
emphasis  has  been  largely  placed  either  on  the  location  of  a 
straight  line  transit  through  the  barrier  or  on  the  speed  of 
the  transit.  Each  choice  of  a  strategy  space  leads  to  techni¬ 
cal  problems  peculiar  to  that  choice  and  hence  to  assumptions 
designed  to  make  the  analysis  more  tractable.  (For  example, 
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if  we  are  comparing  patrol  patterns,  assumptions  can  be 
made  to  make  the  detection  depend  only  on  the  closest  ap¬ 
proach  distance  of  the  two  submarines.)  With  these  assump¬ 
tions,  the  following  results  were  obtained  (using  maximin 
probability  of  detection  as  the  principal  solution  concept). 

(1)  A  basic  proposition  which  has  been  widely  ignored 
in  ASW  analyses  is  that  there  can  be  no  sea  exercise,  no 
simulation,  and  no  game  theoretic  analysis  without  the  definite 
specification  of  three  factors: 

(a)  the  information  available  to  the  parti¬ 
cipants  as  to  the  context  of  the  action; 

(b)  the  technological  possibilities  (such 
as  speeds,  maneuverability,  and  detection 
capabilities)  available  to  the  participants; 

(c)  the  objectives  of  the  participants  in 
terms  of  quantified  measures  or  payoffs. 

In  Chapter  in  of  this  report,  "Game -Theoretic  Analyses  of  AfsW 
Problems',' the  effects  of  incorrect  specification  of  these  fac¬ 
tors  are  discussed  and  illustrated  by  examples.  It  is  shown 
that,  individually,  each  of  these  factors,  if  incorrectly  for¬ 
mulated,  can  lead  to  solutions  which  are  seriously  misleading. 
Specifically: 

(2)  The  analysis  of  speed  games  having  an  objective 
of  secure  detection  was  compared  with  the  simpler  one  of 
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detection,  and  various  approaches  were  suggested  for  the 
solution  of  these  more  complex  games.  Most  of  the  approaches 
led  to  the  same  mathematical  model:  the  "Difference  Game", 
which  is  partially  solved  in  Chapter  III,  Section  3  of  this  re¬ 
port.  Optimal  strategies  require  both  the  patroller  and  the 
transitor  to  use  slow  speeds. 

(3)  Extending  the  previous  MATHEMATICA  work  on 
matrix  games,  a  new  series  of  analyses  has  been  undertaken 
in  which  different  range  laws  are  compared.  This  leads  to  a 
more  general  definition  of  Secure  Sweep  Width  covering  the 
minimax  strategies  in  a  competitive  situation.  The  previous 
definition  was  applicable  only  for  "games  against  nature", 

i.  e. ,  situations  in  which  the  transitor  chose  at  random. 

(4)  Continuous  payoff  functions  representing  range 
laws  analogous  to  those  studied  in  the  matrix  games  were 
introduced,  and  it  was  shown  that  optimal  mixed  strategies 
retained  essentially  the  same  character  as  in  the  correspond¬ 
ing  matrix  games.  In  particular,  the  optimal  strategies  re¬ 
quired  only  a  finite  number  of  points  of  entry  (for  the  transi- 
tor)  and  points  of  defense  (for  the  patroller). 

(5)  A  repeated  game  with  incomplete  information  on 
the  transitor' s  side  was  analyzed.  This  was  an  attempt  to 
model  the  learning  aspects  of  the  detection-evasion  situation. 

It  led  to  the  result  that  the  game  should  be  played  (i.  e. ,  the 


-9- 


patrol  and  transit  strategies  chosen)  as  if  all  payoffs  were 
the  average  of  what  the  transitor  could  reasonably  expect, 
given  his  incomplete  information.  This  type  of  analysis  has 
the  potential  of  providing  an  important  link  between  informa¬ 
tion  and  strategy  in  repetitive  situations. 
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II.  3  The  Problem  of  Secure  Detection  vs  Detection 


The  patroller's  mission  is  in  the  first  place  to  detect 
the  transitors.  Other  events  may  then  follow:  localization 
and  classification  in  peacetime  (possibly  also  trailing);  finally, 
approach  and  kill  in  wartime.  But  there  is  a  considerable 
difference  between  detection  and  secure  detection  in  either 
peace  or  war.  Secure  detection  means  that  the  patroller  is 
not  detected  by  the  transitor  in  the  process  of  detecting  the 
transitor.  The  advantages  of  secure  detection  are  manifold; 
in  wartime  it  may  spell  survival,  while  in  peacetime  the 
information  on  enemy  submarine  movements  is  gained  with¬ 
out  revealing  our  submarine  movements  to  the  enemy. 

A  game -theoretic  model  which  uses  the  concept  of 
secure  detection  in  its  definition  of  objective  is  likely  to  dif¬ 
fer  substantially  from  a  model  based  either  on  detection  or  on 
evasion.  Not  only  is  the  measure  of  effectiveness  different, 
Nit  che  optimal  strategies  also  turn  out  to  be  quite  different. 

In  particular,  optimal  patrol  patterns  have  been  found  in  sim¬ 
ple  games  of  secure  detection  which  are  quite  different  com¬ 
pared  to  the  previous  one-way  street  of  detection  only. 
Unfortunately  the  mathematical  difficulties  e  analyzing  se¬ 
cure  detection  probabilities  are  rather  more  severe  than  in 
the  case  of  detection  probabilities.  We  have  considered  a 
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compromise  model  which  approximates  the  probability  of  se¬ 
cure  detection  by  a  weighted  combination  of  the  two  probabili¬ 
ties:  (i)  that  the  patroller  detects  the  transitor,  and  (ii) 
that  the  patroller  evades  detection  by  the  transitor.  In  a  sim¬ 
ple  analysis  of  patrolling  strategies  using  only  the  speed 
variable  it  has  been  found  that  for  purposes  of  maximizing 
detection  probabilities  the  patrolling  submarine  should  De 
moving  at  maximal  speed  for  a  fraction  of  the  patrol  cycle, 
while  for  purposes  of  evasion  he  should  never  exceed  that 
speed  at  which  self -noise  is  known  to  take  a  sharp  upward 
turn  as  a  function  of  speed.  What  recommendation  will  the 
game -theoretic  analyst  make  if  the  mission  emphasizes  both 
detection  of  transitor s  and  evasion?  The  answer  will  in  fact 
depend  on  the  relative  importance  of  the  opposing  parts  of 
the  objective. 

By  examining  a  hypothetical  case  in  which  it  is 
assumed  that  there  is  zero  probability  of  a  counterdetection 
following  detection,  we  show  in  Chapter  HI,  Section  1,  that 
the  compromise  objective  function  is  the  difference  between 
the  probabilities  of  detection  and  counterdetection.  This 
model  is  called  "The  Difference  Game"  and  a  generalization 
of  it  allows  a  utility  weight  to  be  applied  to  evasion  of  counter - 
detection.  In  Chapter  III,  Section  3,  the  solution  is  found  for 
all  cases  where  evasion  is  rated  at  least  as  important  as 
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detection.  The  solution  requires  both  the  patroller  and  the 
transitor  to  move  at  their  quiet  speeds  for  optimization  with 
respect  to  the  same  weighted  difference  function.  It  should 
be  pointed  out  that  the  assumptions  for  the  transitor  are  the 
same  as  for  the  patroller  but  with  signs  reversed.  The  tran¬ 
sitor,  in  other  words,  is  emphasizing  detection  of  the  patrol¬ 
ler  at  least -as  much  as  evasion  of  detection.  The  earlier 
analyses  of  speed  games  with  simple  detection  probability 
objective  functions  (see  refs  [5]  and  [6])  showed  that  optimal 
transit  speed  according  to  the  model  could  be  faster  than 
quiet  speed,  but  ignored  the  possibility  that  the  resulting 
secure  sweep  width  for  the  transitor  might  be  less  than  for 
the  patroller.  In  some  ca,ses  (depending  on  ratio  of  noise 
function  slopes)  this  actually  did  occur,  invalidating  the  solu¬ 
tion  on  logical  grounds.  The  present  analyses,  although  they 
do  not  solve  the  cases  where  the  patroller  rates  detection 
more  important  than  evasion,  correct  the  error  for  an  import¬ 
ant  class  of  game -theoretic  models  including  the  "equal  im¬ 
portance"  case  which  is  represented  by  an  unweighted  differ¬ 
ence  of  detection  probabilities. 


References  are  to  the  Bibliography  on  p.  109. 
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II.  4  The  Analysis  of  Position  of  the  Pat;  oiler  within  a  Zone 

(Matrix  Games) 

By  keeping  speed,  other  strategic  variables  and  exo¬ 
genous  parameters  constant,  it  is  v'u<s\b)e  +o  analyse  the  rela¬ 
tionship  between  detection  range  laws  and  the  eac?,„e  of  patrol 
positions  (and  transit  lanes).  The  zone  he.  divided  into  an 
indefinitely  large  number  of  lanes;  by  oc-  spying  ene  or  more 
of  these  lanes  the  patroller  is  said  to  ..  j.rlon  with? 
the  barrier  zone. 


n  transit  lanes 


Figure  H-  1 

No  question  of  patrol  path  configuration  in  two  dimensions  is 
involved.  Each  patrol  path  is  evaluated  only  in  terms  of  the 
fraction  of  patrol  cycle  time  spent  in  each  position.  In  ref.  [6]  , 
MATHEMATIC  A1  s  previous  analysis  of  the  position  strategies, 
two  types  of  strategy  were  distinguished. 

(i)  The  on- station  strategy.  The  patroller  occupies  a 
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lane  and  remains  close  to  its  center. 

■  (ii)  Continuous -motion  back -and -forth  strategy.  The 
patroller  moves  at  constant  speed  back  and  forth  from  end  to 
end  thus  dividing  up  a  patrol  cycle  uniformly  among  the  lanes. 

In  all  cases  the  transitor  was  assumed  to  come  straight 
through  the  zone  at  constant  speed.  Mixed  strategies  could 
be  interpreted  in  terms  of  a  unique  random  choice  of  a  pure 
strategy,  or  as  a  time  mixture  assigning  appropriate  fractions 
to  different  lanes.  -The  range  laws  considered  were  of  the 
cookie-cutter  type  except  for  the  3 -lane  analysis  of  Appendix 
IV  which  included  a  range  law  of  the  form  (0,  p,  1,  p,  0)  where 
the  probability  is  1  only  for  the  lane  actually  occupied.  The 
continuous -motion  patrol  assumed  that  the  probability  of  detec¬ 
tion  was  2q  (not  necessarily  the  same  as  p)  at  the  center  and 
q  at  the  edges. 

In  this  report  the  position-strategy  analysis  is  extended 
in  two  ways. 

(i)  A  more  realistic  range  law  is  assumed  and  the 
matrix  games  are  solved  in  Chapter  III,  Section  4.  The  range 
law  incorporates  a  linear  decrease  in  the  probability  of  detec¬ 
tion  symmetrically  outward  from  the  center,  dropping  grad¬ 
ually  off  to  zero  at  a  finite  range. 

(ii)  The  division  of  the  zone  into  a  finite  number  of 
lanes  and  positions  is  replaced  by  the  more  realistic  assump- 
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tion  of  a  continuum  of  lanes  and  positions  in  Chapter  IL.,  Sec¬ 
tion  5.  The  range  law  described  above  is  replac  i  by  its  con¬ 
tinuous  analog,  i.  e. ,  a  triangular-shaped  function.  Again  the 
extreme  range  is  finite,  but  there  is  a  continuous  variation  in 
probability  of  detection  unlike  the  discontinuous  cookie -cutter 
range  law. 

While  the  results  for  these  new  game -theoretic  models 
are  not  easily  described  in  a  few  words,  three  salient  features 
may  be  noted: 

First,  there  are  a  finite  number  of  favored  lanes  and 
positions,  and  in  the  case  of  the  matrix  games  this  number  is 
generally  less  than  the  total  number  of  lanes.  Even  when  the 
transito?  is  permitted  to  enter  the  zone  at  any  point  across  the 
entire  width,  optimal  minimax  strategy  calls  for  the  use  of 
only  a  small  number  of  lanes.  Similarly  and  symmetrically, 
the  patroller's  strategy  requires  him  to  ignore  the  majority 
of  positions  in  favor  of  the  few.  This  result  is  naturally  re¬ 
lated  to  the  assumed  form  of  range  law,  but  ref  [4]  shows  that 
it  is  qualitatively  similar  to  the  results  found  in  the  analysis 
of  continuous  convex  games  with  bell- shaped  kernels.  For 
these  games  also  there  are  only  a  finite  number  of  pure  stra¬ 
tegies  which  are  used  to  define  the  optimal  mixed  strategies. 

Second,  a  salient  feature  of  the  results  described  in 
ITT.  4  and  III.  5  is  the  general  behavior  of  the  value  of  the  game. 
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as  the  parameters  change.  Consider  a  zone  of  width  D  and 
the  triangular  detection  law  extending  from  x  -  d  to  x  +  d 
when  position  x  is  occupied  by  the  patroller.  Then 

is  a  good  approximation  to  the  value  of  the  game  in  most 

#  dl) 

cases  ;  thus  a  secure  sweep  width  (SSW)  of-  g- — -g-  is  as¬ 
sured  by  the  use  of  the  minimax  strategies  by  both  submarines. 
For  example,  if  d  is  very  much  smaller  than  D  ,  a  SSW  of 
approximately  d  is  achieved:  this  is  only  half  of  the  corres¬ 
ponding  figure  for  a  cookie- cutter  range  law.  Now  consider 
a  smaller  zone  (or,  equivalently,  larger  range  of  probable 
detection)  and  let  D  =  2d  .  Then  the  SSW  is  just  (2/3)d  . 

Finally,  we  have  also  extended  the  concept  of  secure 
sweep  width  to  a  strategic  confrontation.  The  "secure  sweep 
width"  concept,  introduced  in  ref  [8],  is  intended  to  be  a 
numerical  measure  of  the  patroller's  ability,  to  detect  the 
transitor  without  being  previously  counterdetected.  It  is 
defined  as  the  width  of  frontage  over  which  target  crossings 
are  equally  likely  at  all  points,  times  the  fraction  of  targets 
on  which  the  patroller  makes  secure  detection.  In  other 
words,  it  is  the  product  of  the  zone  width  D  with  the  proba- 

*  Except  when  d  is  nearly  as  large  as  D  . 

The  concept  is  extended  in  the  natural  way  for  non 
uniform  transits;  see  next  paragraph. 


bility  of  the  patroller  detecting  the  transitor,  given  that  the 
transitor  is  assumed  to  use  all  transit  lanes  with  equal  pro¬ 
bability.  This  last  assumption  makes  of  this  definition  a  non- 
strategic  measure  since  the  transitor  is  not  endowed  with  the 
possibility  of  making  strategic  choices  to  counter  the  patrol¬ 
ler's  strategy.  If  we  change  and  extend  the  definition  of 
SSW  by  allowing  the  transitor  to  adopt  strategic  choices  and 
assign  both  participants  min.imax  strategy  choices  then  the 
models  studied  give  SSW's  under  varying  range  laws.  Thus, 
if  a  patroller  is  assumed  to  make  secure  detection  of  any  tran¬ 
sitor  within  d  distance  an  SSW  of  approximately  2d  is  found. 
On  the  other  hand,  if  a  patroller  is  assumed  to  detect  a  tran¬ 
sitor  wi*h  probability  1  -  x/d  ,  if  x  <  d  is  the  distance  be¬ 
tween  the  subs,  and  not  detect  otherwise,  then  the  SSW  is 
approximately  dD/(d  +  D)  . 

It  appears  that  the  SSW  concept,  which  is  intended  to 
give  a  rough  measure  of  ability  to  detect,  can  in  fact  be  a 
truly  strategic  measure.  To  be  precise  this  suggests  the 
definition:  secure  sweep  width  is  the  width  of  frontage  over 
which  the  transitor  may  attempt  crossings  times  the  proba¬ 
bility  of  the  patroller  detecting  the  transitor  when  both  par¬ 
ties  use  minimax  strategies.  This  definition  will  remain  use¬ 
ful  even  when  the  transitor  is  known  to  employ  a  different 
strategy  so  long  as  the  patroller  is  using  the  minima.;  strategy. 
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Analogous  definitions  of  SSW  can  be  constructed  for  other 
strategy  concepts,  and  in  the  simplest  case  of  a  uniform 
patrol  the  new  definition  proposed  is  equivalent  to  the  previous 
definition. 
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II.  5  Conclusions 


Considerable  effort  has  been  devoted  to  the  study  of 

problems  concerning  submarine-submarine  detection  by 

mathematical  analysis,  gaming  and  computer  simulation. 

MATHEMATICA  has,  from  May  1965  to  the  present.been  re¬ 
ft 

viewing  for  ONR  the  question:  "How  can  the  theory  of  games 
be  fruitfully  applied  to  the  study  of  ASW  detection  and  evasion 
strategies?"  We  initially  considered  patrols  by  an  individual 
submarine  in  a  known  rectangular  zone;  subsequently,  while 
the  results  for  a  single  zone  have  been  extended  and  improved, 
attention  has  also  been  devoted  to  the  construction  of  barrier 
models  in  a- more  general  setting.  For  reasons  which  are 
detailed  below,  MATHEMATICA' s  investigators  have  concluded 
that  the  mathematical  theory  of  games  cannot,  at  present,  pro¬ 
vide  a  suitable  methodology  for  completely  analyzing  the  stra¬ 
tegic  alternatives  which  are  available  to  patrolling  submarines 
--  even  in  the  restricted  case  of  a  single  submarine  patrolling 
a  fixed  zone.  (By  analogy  with  economics,  we  might  express 
this  viewpoint  differently  by  the  statement  that  the  micro-  stra¬ 
tegies  are  inadequately  represented  in  any  mathematical  models 
which  can  be  formulated  at  present.)  We  anticipate  that  game- 
theoretic  methods,  in  combination  with  other  analytical  ap- 

*Under  contracts  Nonr  4937(00)  and  N00014-66-C-0215. 


proaches,  will  provide  useful  and  practical  insights  for  the 
macro-strategies  of  large  barrier  situations, rather  than 
complete  formal  solutions. 

These  conclusions  should  not  be  regarded  as  mainly 
negative,  because  only  the  most  elementary  of  practical  prob¬ 
lems  in  other  fields  possess  complete  mathematical  solutions. 
There  are  two  primary  difficulties  at  present: 

First,  the  information  which  MATHEMATIGA  has  ob¬ 
tained  on  the  determinants  of  patrol  and  transit  strategies  -- 
such  as  the  range  laws  of  sonar  detection,  the  effects  of  sea 
state  and  speeds  of  both  patrolling  and  transiting  submarines 
on  sonar  detection  and  counter  detection,  the  effects  of  depth 
of  the  submarine,  convergence  zones,  thermal  layer  and  so 
forth  --  seems  to  be  at  present  a  large,  complex,  and  poorly 
structured  body  of  data,  and  is  therefore  not  easily  provided 
as  an  input  to  the  conventional  game -theoretic  models  of  stra¬ 
tegic  conflict.  (Such  models  might  be  used  either  to  generate 
hypotheses,  or  to  test  and  verify  hypotheses;  the  data  require¬ 
ments  would  be  somewhat  different.) 

Second,  the  variables,  which  are  known  to  determine 
the  probability  of  a  detection  or  counterdetection,  are  numerous 

and  interdependent  in  complex  ways.  We  have  firm  information 

P  ' 

about  so  few  of  these  potential  interactions  that  a  useful 
mathematical  theory  is  almost  inaccessible  at  present.  Such  a 
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theory  apparently  would  require  the  analysis  of  strategies  as 
functions  of  several  control  variables;  the  resulting  mathe¬ 
matical  problems  are  difficult  even  to  formulate,  and  still 
more  difficult  to  use  as  a  basis  for  practical  results  of  im¬ 
portance  for  application  to  the  real  ASW  situation. 

Both  in  on-going  research  and  in  the  sea  exercises 
that  are  undertaker  to  test  the  various  recommended  modes 
of  patrolling  and  barrier  design,  a  large  body  of  information 
is  being  built  up  about  ASW  and  its  interface  with  oceanography. 
MATHEMATICAL  analysts,  on  the  basis  of  their  familiarity 
with  that  body  of  information,  have  found  that  the  current 
level  of  sophistication  in  understanding  and  structuring  that 
data  does  not  appear  to  permit  a  successful  mathematical 
analysis  of  the  patrol  and  transit  strategies  at  the  zonal  level. 
Several  game -theoretic  results  have  been  obtained  by  MATHE- 
MATICA  which  can  serve  as  a  beginning  for  a  mathematical 
theory  of  barrier  detection  in  the  large;  for  example,  mini¬ 
max  speeds  for  certain  detection-and -evasion  conflict  models 
have  been  derived  (with  other  parameters  held  constant).  These 
show  that,  the  greater  the  emphasis  on  evasion  (by  the  patroller) 
the  more  plausible  is  the  recommendation  of  slow  speed. 

Another  example  is  the  investigation  of  the  relationship  be¬ 
tween  probability  of  detection  and  range  law.  The  investigation 
has  led  to  an  extension  of  the  concept  of  Secure  Sweep  Width  to 
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the  strategic  situation.  Asa  third  example,  we  have  studied 
a  repeated  game,  the  individual  steps  of  which  are  opportu¬ 
nities  for  detection  and  counter -detection.  It  i3  assumed 
that  the  patroller  knows  more  about  his  own  range  of  detection 
under  the  prevailing  environmental  conditions  than  does  the 
transitor,  and  the  patroller  may  choose  whether  to  make  use 
of  this  information.  The  transitor  may  be  able  to  infer  (from 
observing  the  patroller's  strategic  moves  in  the  steps,  which 
are  called  "stage  games")  something  about  the  unknown  range 
law  of  the  patroller.  This  analysis  shows,  in  one  particular 
case  of  interest: 

(a)  that  the  patroller  cannot  profit  by  using  his  infor¬ 
mation  in  a  strategic  mode,  and 

(b)  that  the  transitor,  to  optimize,  must  play  the 
average  game  expected  under  his  prior  beliefs  about  the 
patroller's  detection  capability. 

Such  results  are  obtained  from  zonal  analysis,  but  be¬ 
cause  of  the  previously  described  complexities  they  are  really 
of  limited  value  for  the  improvement  of  ASW  patrols,  within 
the  zone.  We  feel  that  the  generalization  of  the  task  to  the 
overall  discussion  of  barrier  strategy  would  permit  the  use 
of  the  results  in  an  interesting  and  potentially  valuable  theory. 
For  instance,  if  the  relative  positioning  of  a  number  of  sub¬ 
marines  of  differing  detection  capabilities  were  analyzed,  the 
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methodology  of  MA THEMA TICA '  s  described  work  on  Matrix 
Games  could  be  brought  to  bear. 

Game  theory  requires  precise  detailed  infromation  on 
the  three  factors  mentioned  above,  (and  discussed  in  greater 
detail  in  the  next  Chapter)  for  its  successful  application  to 
ASW.  Whenever  some  of  this  information  is  lacking,  the 
partial  analysis  will  yield  only  qualitative  results.  Whereas 
such  results  may  provide  meaningful  insights  into  the  princi¬ 
ples  of  a  rather  large  systems  problem  such  as  barrier  de¬ 
sign,  they  do  not  at  present  appear  to  be  very  practical  at 
the  level  of  components  such  as  a  patrol  zone.  For  a  suc¬ 
cessful  strategic  analysis  of  the  zonal  problem  the  require¬ 
ments  are  at  least  the  following: 

(a)  The  functional  relation  between  detection  range 
and  its  determinants  --  patrol  speed,  oceanographic  condi¬ 
tions,  target  speed,  depth, etc.  --  should  be  known  and  para¬ 
metrized. 

(b)  The  forms  and  limits  of  strategic  behavior  allowed 
to  the  patrolling  and  transiting  submarines  must  be  clearly 
spelled  out.  Excessive  complexity  and  variety  here  creates 
difficulties  for  the  analyst. 

(c)  The  objectives  must  be  formally  stated,  inasmuch 
as  the  solutions  will  depend  critically  on  them.  Several  alter¬ 
native  objectives  may  be  used,  requiring  separate  analyses  and 
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leading  of  course  to  distinct  solutions.  In  view  of  the  variety 
of  missions  to  which  a  barrier  submarine  is  assigned  in 
peacetime  and  war,  there  is  clearly  a  necessity  for  several 
of  these  alternative  analyses. 
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II.  6  Recommendations 


Asa  result  of  the  conclusions  presented  above, 
MATHEMATICA  recommends : 

(i)  that  the  attempt  to  model  the  single  patrol  zone 
ASW  problem  as  a  game  be  discontinued  for  the  present; 

(ii)  that  further  research  be  conducted  on  the  use  of 
mathematical  models  of  ASW  barriers  as  a  whole  at  a  higher 
level  of  aggregation  than  the  zonal  unit; 

(iii)  that  the  work  which  MATHEMATICA  and  other 
investigators  have  begun,  on  the  effects  of  range  law  detec¬ 
tion  on  optimal  strategies,  be  continued;  and 

(iv)  that  the  above  work  should  make  use  of  recent 
advances  in  the  theory  of  Repeated  Games  of  Incomplete 
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III.  Game -Theoretic  Analyses  of  ASW  Problems 


III.  1  Games  with  Different  Objective  Functions 


Introduction 

The  purpose  of  this  section  is  to  discuss  several 
basic  questions  which  underlie  any  analysis  of  the  conflict 
situation  involving  one  or  more  submarines  patrolling  a  bar¬ 
rier  and  one  or  more  submarines  attempting  a  transit  across 
the  barrier.  These  questions  arise  whether  the  analysis 
proceeds  via  sea  exercises,  simulation,  gaming,  or  game 
theoretical  models.  They  have  been  largely  ignored  in  pre¬ 
vious  ASW  analyses  and  our  purpose  is  to  show,  by  3imple 
examples,  how  this  neglect  has  influenced  the  results  ob¬ 
tained  and  restricted  their  practical  usefulness. 

A  first  step  in  any  analysis  of  the  patrol-transitor 
conflict  is  the  definition  of  the  actions  open  to  the  parties 
involved  and  their  objectives  in  the  conflict.  This  is  true 
whatever  the  nature  of  the  analysis.  If  we  are  designing  a 
sea  exercise,  we  must  give  the  ’'rules  of  the  game"  to  the 
two  sides  and,  in  general,  these  will  circumscribe  their 
range  of  free  action  rather  sharply.  The  rules,  as  given  to 
the  participants,  will  consist  of  three  parts:  (1)  the  types  of 
information  permitted  to  them  either  from  their  own  equip¬ 
ment  or  observations  or  from  outside  sources;  (2)  the  range 
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of  actions  that  are  permitted  to  them  at  various  points  of  the 
situation  as  a  function  of  the  information  available  to  them  at 
those  points;  (3)  the  object  of  each  antagonist  in  the  exer¬ 
cise,  preferably  in  terms  of  some  index  of  merit  which  could 
be  computed  at  the  end  of  a  contest.  (It  may  happen  that  the 
index  of  merit  could  only  be  evaluated  by  an  umpire  who  will 
have  more  information  than  any  of  the  individual  participants.) 

To  be  more  specific  about  these  three  ingredients  in 
a  sea  exercise,  we  may  consider  a  simple  patrol -transitor 
exercise  run  on  a  rectangular  area  of  the  ocean,  assumed  to 
be  oriented  so  that  the  sides  run  North-South  and  East-West. 
For  the  patrol  submarine,  under  (1)  we  may  specify  the 
exact  location  of  the  patrol  rectangle,  the  length  of  time  of 
the  exercise,  the  fact  that  one  submarine  of  a  given  type  will 
be  attempting  a  single  transit  through  the  area  from  East  to 
West  at  given  depth  sometime  during  the  exercise,  and  that 
the  patrol  will  start  the  exercise  at  a  given  point  in  the  area. 
Furthermore,  we  may  specify  the  types  of  detection  equip¬ 
ment  that  may  be  used  by  the  patrol  submarine  and  thus  the 
kinds  of  information  about  its  own  location  and  speed  and 
about  the  transitor' s  location  and  speed.  All  of  these  factors 
constitute  a  part  of  the  patrol's  knowledge  of  the  extensive 
form  of  the  exercise;  indeed,  that  part  which  is  exclusive  of 
the  alternatives  open  to  him  and  the  transitor  01  based  on 


that  information.  As  for  similar  information  to  be  provided 
to  the  transit  submarine  in  the  exercise,  we  may  tell  him 
the  location  of  the  rectangle,  the  length  cf  time  of  the  exer¬ 
cise,  the  fact  that  one  submarine  of  given  type  will  be  patrol¬ 
ling  somewhere  in  the  area  at  a  given  depth.  Furthermore, 
we  may  specify  the  types  of  detection  equipment  available  to 
the  transitor  and  thus  the  kinds  of  information  about  its  own 
location  and  speed  and  about  the  patrol's  location  and  speed. 
Again  theso  constitute  the  transitor1  c  knowledge  of  the  exten¬ 
sive  form  of  the  exercise,  exclusive  of  the  alternatives  open 
to  him  or  based  on  that  information. 

As  for  ..(2),  the  range  of  actions  open  to  the  two  sub¬ 
marines,  we  may  specify  them  in  great  detail,  such  as  only 
allowing  the  transitor  straight-line  crossings  of  the  rectangle 
at  constant  speed,  or  we  may  allow  considerable  choice,  such 
as  zig-zag  paths  at  varying  speeds  depending  on  information 
reaching  the  transitor  through  its  detection  equipment.  Simi¬ 
lar  comments  apply  to  the  patrolling  submarines. 

The  all-important  result  of  the  combination  of  (1)  and 
(2)  is  the  concept  of  a  strategy  for  a  submarine  in  an  exercise. 
This  concept  reflects  the  planned  interaction  of  information 
with  the  freedom  of  action  allowed  to  the  submarine.  Without 
the  specification  of, or  explicit  assumptions  concerning,  (1) 
the  information  about  the  extensive  form  of  the  conflict  and 
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(2)  the  restriction  of  the  range  of  action  open  to  the  partici¬ 
pants,  there  can  be  no  exercise,  no  simulation,  or  no  game- 
theory  mode’.. 

These  ingredients  are  by  no  means  the  whole  story, 
for  they  only  speak  to  what  the  submarines  know  and  can  do. 
They  say  nothing  about  why  they  are  doing  it,  what  their  ob¬ 
jectives  are  or,  in  game-theoretic  terms,  what  the  payoff  is. 
If  we  return  to  the  question  of  an  operational  exercise,  we 
are  concerned  with  a  measure  of  the  performance  of  the  par¬ 
ticipating  submarines.  A  typical  specification  for  the  patrol 
submarine  would  be  to  say  that  he  wins  if  he  detects  the 
transit  without  being  detected  himself,  that  the  transitor  wine 
if  he  crosses  the  rectangular  area  without  being  detected  and 
that  the  exercise  is  a  draw  if  both  submarines  correctly  ue- 
tect  each  others'  presence.  If  we  are  dealing  with  a  larger 
number  of  transits  than  one,  we  may  use  the  ratio  of  first, 
detections  by  the  patrol  to  the  number  of  transits  as  a  figure 
of  merit  to  measure  the  performance  of  the  patrol  side  of  the 
exercise.  Both  the  simplicity  of  these  suggestions  and  the 
dependence  on  the  structure  of  the  underlying  actions  and 
information  possibly  available  to  the  sides  in  the  exercise 
mak*-  '•  ict  crystal  clear:  the  optimal  behavior  of  the  par¬ 
ties  ’.nts  may  vary  significantly  if  we  change  the  objective 
function.  Fundamentally  it  is  this  phenomenon  that  we  wish 
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to  illustrate  in  this  chapter. 

To  recapitulate,  there  can  be  no  exercise,  simulation, 
or  game  without  the  specification  of  three  factors: 

(1)  the  information  available  to  the  participants  as  to 
the  rules  of  the  game; 

(2)  the  technological  possibilities  (that  is,  the  speeds, 
maneuverability,  and  detection  capabilities)  avail¬ 
able  to  the  participants; 

(3)  the  objectives  of  the  participants,  preferably  in 
the  form  of  numerical  measures  that  could  be 
computed  by  an  observer  of  the  complete  action. 

Incorrect  specification  of  any  one  of  these  can  lead  to 
seriously  incorrect  calculation  ox  optimal  behavior  as  we 
shall  now  show  by  means  of  examples. 

Misinformation  about  the  Extensive  Form 

The  difficulty  described  by  the  title  of  this  subsection 
may  be  encountered  in  a  number  of  forms  and  may  differ  in 
importance  according  to  the  situation.  That  it  is  of  practical 
relevance  is  easily  seen:  surely  it  is  unrealistic  to  assume 
that  the  transiting  submarine  has  an  accurate  map  of  the  bar¬ 
rier  with  its  complete  geometry  and  the  numbe’  and  type  of 
patrol  submarines  present  in  it  in  a  given  period  of  time,  and 
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equally  unreasonable  to  assume  that  the  patrolling  submarines 

will  know  the  number  and  type  of  the  submarines  attempting 

to  transit  through  the  barrier.  This  second  possibility  is 

illustrated  by  the  following  example,  patterned  on  a  game 

* 

which  we  have  discussed  previously: 

First  Extensive  Form:  Let  the  payoff  matrix 


T1 

T2 

T3 

p 

1 

0.  25 

0 

1 

p> 

0.  25 

1 

0.25 

P3 

0 

0.  25 

1 

P4 

0.25 

0.5 

0.  25 

The  patroller's  strategies  are: 

P^  =  patrol  in  Northern  boundary  cell 
P ^  =  patrol  in  central  cell 

P3  =  patrol  in  Southern  cell 

P4  =  continuous  motion  patrol 

The  transitor's  strategies  are: 

Tj  =  transit  through  Northern  cell 

T2  =  transit  through  central  cell 


Pp.  44-51  of  ref  [6]. 
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T3  =  transit  through  Southern  cell  . 

The  underlying  assumption  is  that  one  transit  will  be  attempt¬ 
ed  and  the  payoff  is  the  probability  that  all  transits  will  be 
detected. 

This  example  is  easily  solved.  The  optimal  strategies  are 


P* 

= 

3/8  Px  +  1/4  P2  +  3/8  P, 

T# 

= 

3/8  Tj  +  1/4  T2  +  3/8  T3 

and  the  minimax  value  is  7/l6  . 

Now  let  us  alter  the  game  by  having  the  transitor  at¬ 
tempt  to  pass  at  least  one  of  two  submarines  through  the 


barrier.  If  we  denote  the  resulting  pure  strategies  for  the 
transitor  by  (T^»  T^)  for  1  <  i  <  j  <  3  ,  then  the  payoff 
matrix  becomes: 


(T^Tj) 

<TrT2) 

(Tj*  T3) 

(t2,t2) 

(t,,t3) 

(t3,t3) 

pi 

1 

0.25 

0 

0.0625 

0 

0 

P 

2 

0.0625 

0.25 

0. 0625* 

1 

0.25 

0.0625 

P3 

0 

0 

0 

0.0625 

0.25 

1 

P 

4 

0.0625 

0.  125 

0.0625 

0.  25 

0.  125 

0.0625 

Here,  as  before,  the  payoff  is  interpreted  as  the  probability 
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that  all  transits  are  detected. 

This  matrix  has  a  saddlepoint  solution  (indicated  by 
the  asterisk)  consisting  of 


T  =  (TrT3) 

with  minimax  value  .  This  has  the  obvious  interpretation 
that  the  patrol  should  sit  on  the  center  station  while  the  tran- 
sitor's  submarines  should  run  the  sides  of  the  barrier.  If 
the  patrol  submarine  plays  the  optimal  strategy  for  the  game 
with  but  one  transit  submarine,  then  the  probability  that  he 
detects  all  of  the  transits  is  reduced  to  ^  ! 

We  have  not  solved  the  problem  of  constructing  rea¬ 
sonable  patrol  strategies  when  the  number  of  submarines 
attempting  transit  is  unknown.  The  problem  is  clearly  but  one 
aspect  of  a  central  theme  of  this  report,  namely,  that  the 
well-defined  area  games  may  mislead  if  unrelated  to  a  larger 
context. 

Mis  specification  of  the  Technological  Possibilities 

In  our  definition  of  the  context  of  the  analysis  of  a 
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transit-patrol  conflict,  we  included  under  the  category  of 
technological  possibilities  open  to  the  participants  such  fac¬ 
tors  as  speed,  detection  capability,  and  patrol  or  transit  pat¬ 
tern.  Naturally,  in  any  theoretical  modeling  of  the  situation 
we  will  use  the  best  possible  estimates  of  speed  and  will  at¬ 
tempt  completeness  (within  the  context)  in  listing  the  geomet¬ 
ric  patterns  allowed  to  the  patrolling  and  transiting  subma¬ 
rines.  Even  if  we  assume  that  these  parameters  are  known 
accurately,  the  remaining  variability  in  the  detection  capabili¬ 
ties  are  such  to  render  the  results  suspect.  If  we  deal  with  sea 
exercises,  the  variability  of  sea  state,  equipment  perform¬ 
ance,  and  the  small  samples  obtained  render  most  results 
statistically  insignificant.  If  we  consider  simulation  or  gam¬ 
ing  experiments,  the  lack  of  reliable  hypotheses  to  program 
for  detection  renders  the  results  unreliable  when  interpreted 
as  guides  to  practice.  If  we  are  attempting  game  theoretical 
analyses,  the  assumptions  about  detection  capability  can  alter 
the  conclusions  radically  (even  when  all  other  parameters  are 
held  constant. ) 

This  phenomenon  can  be  illustrated  by  a  simple  ex¬ 
ample  of  the  same  type  used  in  the  previous  subsection. 
Consider  the  general  game  matrix: 
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Ti  T2  T3 


where  the  strategies  have  the  same  interpretation  as  in  the 
previous  subsection.  If  we  consider  the  case  of  detection 
probabilities  p  =  q  =  1/4  as  before,  the  optimal  strategies 


3/8P1  +  l/4P2  +  3/8P3 


3/8T1  +  l/4T2  +  3/8T3 


with  minimax  probability  of  detection  equal  to  If  we  vary 
the  parameters  of  detection  through  the  interval  <  p  = 
q  <  ,  the  optimal  strategy  for  the  patroller  varies  in  the 

following  manner: 


p  -  r^rpi  +  t^-p2  +  rr^p3  • 


However,  the  endpoint  p  =  q  =  this  formula  yields 


P  =  ■jPj  +  jP^  »  whereas  the  strategies  P^  and  P^ 
dominate  this  weakly.  If  this  problem  were  presented  at  the 
endpoint  p  =  q  =  j  the  payoff  matrix  is 


Common  sense  dictates  the  choice  of  P2  or  P^  (the  center 
station  or  the  continuous  patrol).  Nevertheless  with  p  =  q 
=  -j  -  €  ,  these  strategies  are  hardly  used  at  all: 


fnrpi 


i 

1+e 
1  +  4* 


Thus  a  slight  change  in  the  detection  specification  can  cause 


a  radical  change  in  the  strategy  proposed  as  optimal. 


Mis  specification  of  the  Objective  Function 


This  is  perhaps  the  most  important  part  of  formulating  a 
theoretical  model  of  any  process  and  yet  many  previous  efforts 


in  ASV  seem,  in  retrospect,  to  have  been  extremely  oversim¬ 
plified.  Consider  some  of  the  possible  objectives  of  a  barrier: 

(1)  Make  a  statistical  census  of  the  transiting  submarines 
passing  through.  This  in  turn  could  have  two  possible  varia¬ 
tions; 

(la)  The  purpose  of  the  survey  could  be  an  absolute 
census  of  the  transiting  submarines,  in  order  to  keep 
a  rough  count  of  the  opposing  force  in  various  areas. 

(lb)  The  purpose  of  the  survey  could  be  a  check  on 
large  changes  in  activity.  Thus,  if  the  average  has 
been  1  transit  per  week  a  change  to  2  per  week  might 
not  be  significant,  while  10  in  one  week  would  consti¬ 
tute  an  important  change  in  the  situation. 

(2)  Make  a  complete  census  of  the  number  and  type  of 
submarines  crossing  the  barrier.  (The  barrier  might  or  might 
not  be  interested  in  whether  the  patrol  submarines  are  in  turn 
detected  by  the  transitors.) 

(3)  Prevent  all  transits.  (The  patr  Tling  submarine  will 
necessarily  follow  a  detection  by  other,  more  active,  phases  of 
identification,  pursuit,  and  attack. ) 

Even  this  simple  enumeration  of  possible  objectives  for 
the  patrolling  submarine  should  make  clear  the  inadequacy  of 
the  "probability  of  first  detection"  as  an  objective  function. 
However,  the  difficulty  is  even  more  serious  than  this.  Namely, 


it  is  not  clear  that  the  essence  of  the  real  situation  can  be 
reflected  by  an^  zero-sum  model.  For  example,  if  a  mis¬ 
sion  is  assigned  to  one  side,  the  other  side  may  take  "pre¬ 
vention  of  that  mission"  as  its  criterion;  so  far,  the  game  is 
zero-sum.  But  if  one  side  risks  loss  of  a  submarine,  the 
other  side  is  likely  to  be  prima  facie  indifferent  to  whether 
the  sub  is  lost;  this  is  a  non-zero-sum  aspect.  Under  these 
circumstances,  any  analytical  approach  must  acknowledge  that 
the  situation  is  one  of  partial  competii  >n,  and  must  somehow 
deal  with  the  inherent  conceptual  difficulties. 

The  unavoidable  conclusion  of  this  discussion  is  that, 
although  the  available  methods  may  analyze  local  situations 
and  produce  optimal  solutions,  these  are  likely  to  be  mis¬ 
leading  if  not  erroneous  without  more  realistic  formulations 
of  the  global  contexts  of  which  they  are  a  part. 


III.  2  Fir st -Detection  Games 

An  important  step  in  the  further  development  of  sub- 
marine-versus-submarine  detection  games  is  the  determina¬ 
tion  of  improved  objective  fu.  ctions.  In  a  previous  MATHE- 
MATICA  study  [6]>  consideration  was  given  to  a  detection 
problem  in  which  a  patroller  attempts  to  maximize  his  pro¬ 
bability  of  detecting  a  transitor,  while  the  transitor  attempts 
to  minimize  that  probability.  This  objective  function  was 
also  used  by  Wagner  and  Associates  in  [5];  optimal  strategies 
were  found  for  that  detection  problem.  Unfortunately,  those 
strategies,  although  they  maximized  the  patroller's  proba¬ 
bility  of  detecting  the  transitor,  did  not  assure  that  it  was  a 
first  detection;  realistically,  if  the  transitor  detected  the 
patroller  first,  the  transitor  could  probably  avoid  detection 
by  the  patroller.  The  previous  objective  function  would  there¬ 
fore  only  be  applicable  if  (for  some  reason)  the  transitor  had 
fixed  his  r  jute  and  speed  in  advance,  and  could  not  alter  them 
even  if  he  detected  the  existence  and  location  of  a  patroller. 

In  this  section  several  games  will  be  examined  which 
incorporate  an  improved  objective  function,  namely,  the  pro¬ 
bability  that  the  patroller  detects  the  transitor  before  he  is 
himself  detected.  These  games  make  the  essential  assump¬ 
tion  that  the  range  at  which  detection  first  occurs  is  a  ran¬ 
dom  variable;  otherwise  we  could  not  represent  the  random 


events  "transitor  detects  patroller  first"  and  "patroller  de¬ 
tects  transitor  first." 

In  order  to  facilitate  th~.  development  we  will  first  set 
down  a  sequence  of  definitions  which  will  also  apply  to  the  follow¬ 
ing  section,  III.  3. 

1.  Events 

Attempted  Transit  (AT)  An  occasion  for  possible 
detection  of  an  enemy  submarine  attempting  to  transit  the  bar¬ 
rier,  which  is  patrolled. 

Detection  The  .reception  of  signals  (on  sonar  or  other 
equipment)  which  have  in  fact  originated  from  an  enemy  submarine, 
although  they  may  not  have  been  identified  as  such  with  certainty. 

2.  Range  Variables 

Closest  point  of  approach  (CPA)  We  shall  use  D  for 
the  random  variable  which  corresponds  to  the  distance  between 
the  8ubma:.nes  at  the  CPA,  when  the  AT  occurs  at  a  random 
point  a  .  time.  D  is  also  dependent  on  the  two  speeds,  the  patrol 
path  and  the  transit  path.  We  will  explicitly  treat  only  the  speeds 
here. 

Maximum  range  for  detection  Two  random  variables 
are  defined  here,  S  and  T.  They  are  symmetrical  with  respect 
to  patroller  and  transitor  submarines. 

S£  r  mum  range  at  which  the  patroller  is  capable 
of  detecting  the  t  .isitor  under  the  conditions  which  prevail  on 
a  specific  AT. 
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T  =  maximum  range  at  which  the  transitor  is  capable 
of  detecting  the  patroller  under  the  conditions  which  prevail  on 


Similarly  the  transitor  may  wish  to  maximize  the  pr  /bability 
of  a  secure  detection  of  the  patroller,  P(D  <  T,  S  T)  . 
Notice  these  two  objectives  are  not  complementary  since 
P{D  >  S  or  T  >  S)  =  1  -  P(D  <  T  and  S  <  T)  . 

To  minimize  the  probability  of  re  counter  de¬ 
tection.  We  merely  remark  that  the  trans  \y  wish  to 

minimize  the  probability  F(D  <  S  ,  T  ~  hich  is  the 

complementary  objective  to  the  patroller's  .he  previous 
paragraph.  It  seems  unlikely  that  the  patroller  would  concern 
himself  with  minimizing  P(D  <  T  ,  S  <  T)  explicitly,  al¬ 
though  that  would  partly  be  a  result  of  his  previous  objective. 

4.  The  Games 

We  consider  in  this  section  and  the  following,  a 
class  of  detection  games  with  one  of  the  above  objectives,  or 
some  objective  derived  from  them  by  compounding.  The  stra¬ 
tegy  spaces  are  simple:  the  normalized  speeds,  u  and  v 
which  vary  over  the  intervals  [u  ,  u  ] ,  [v  ,  v  ]  .  It  is 
assumed  throughout  that  both  submarines  follow  straightline 
courses  at  right  angles  to  each  other  in  the  present  simple 
model. 


It  is  important  to  note  that  the  game  stops  when 
either  an  uhdetected  transit  is  completed,  £r  a  detection 
(counterdetection)  occurs.  For  the  subsequent  events  in  the 
latter  case  a  new  game  model  is  started,  which  wall  have 
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I 

1 
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different  features.  Strictly  speaking,  the  game  may  stop  even 
earlier  in  the  transit  sequence:  after  the  transitor  has  passed 
the  CPA  undetected.  However,  this  distinction  makes  no  dif¬ 
ference  to  the  analysis,  so  we  may  ignore  it. 

As  previously  developed  ,  the  probability  of  the 
distance  between  the  patroller  and  the  transitor  ever  falling 
below  r  during  an  attempted  transit  is : 


P(D  <  r)  =  min  1). 


where 


M  = 


1  + 


u 

T 

v 


and  where  u  and  v  are  normalized  speeds  for  the  patroller 
and  the  transitor  respectively  while  w  is  the  half-width  of 
the  barrier.  Let 


dFD(r) 

FD(r)  =  P(D  <  r)  and  fD(r)  =  — 

dFq(r) 

Fg  (r)  =  P(S  <  r)  and  fg  (r)  =  — 


dFT(r) 

FT(r)  =  P(T  <  r)  and  fT(r)  =  — 


See  page  31  of  Reference  [6]. 
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We  defiue  p(u,  v)  as  the  probability  of  a  secure  detec¬ 
tion  by  the  patroller  wnen  th*>  respective  speeds  are  u  and 
v.  It  is  this  objective  which  we  shall  examine  in  more  detail 
now.  As  mentioned  above,  the  patroller  attempts  to  maximize 
and  the  transitor  to  minimize: 


p(u,  v)  =  P(D  <  S,  T  <  S)  . 


Ii  we  assume  that  D  and  S  are  independent  we  can  write 


(2) 


* 

p(u,v)  =  J  FD(r)FT(r)fg(r)dr 


„w/M 


O-*- 

r 


=  j0  (Mr/w)  F  T(r)fg(r)dr  +j  FT(r)fs(r)dr 

w/m 


If  w/M  is  large,  then  an  approximation  for  equ.  (2)  will  be: 


(3) 


P(u.v)  =  —■  ^  rFT(r)fg(r)dr  , 


which  is  of  the  form: 


(4) 


P(u,v) 


E(s|t<S)P(T<S)  . 


If,  furthermore,  T  and  S  have  zero  variances  we  find 
T  <  S  if  and  if  only  cv  <  u  ,  where  c  ii  the  ratio  of  the 
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transitor  and  the  patroller  noise  slopes.  In  this  case: 


E(s|t  <  S)  =  Kecv"u  ,  and: 


(5)  p(u,  v)  a 


2  2  l!Z 

|(1  +  U  /v  )  \  „  CV-U  .£  ^ 

P - — i -  Ke  if  cv  <  u 


if  cv  >  u 


which  is  equivalent  to  the  results  obtained  previously  in  [5]. 


Exponential  Example 


As  an  example,  assume  that  S  and  T  are  indepen¬ 
dently  distributed  according  to  the  negative  exponential  pro¬ 
bability  law,  i.  e. ,  that  for  r  >  0: 


E(S)  = 

E(T)  =  a2_1 

'alr 

P(S  <  r)  =  1  -  e  ,  and 
-a2r 

P(T<  r)  =  1  -  e  . 


Then: 


/T-  -a7r  -a,r 

P(u,v)  =  —  j  r(l  -  e  )axe  1  dr 
J0 


'alr  ,  f  “(al+a2>r  ! 

re  dr  -  i  re  dr  i 


M_  _  _1_  (2+b)b 

w  al  '  (1+b)2 


where  b  =  —  . 

ai 

A  particular  version  of  this  example  is  obtained  by  also 
assuming  that  the  S  and  T  distributions  have  means  equal  to  the 
thresholds  for  detection  and  counterdetection: 

pin  i  1  CV-U  ,  -,,m\  1  rr  u/c-V 

E(S)  as  -  =  Ke  ,  and  r-(T)  =  —  =  Ke 

al  a2 

The  payoff  probability  reads: 


If  p(u,  v)  were  shown  to  be  convex  in  either  u  or  v,  optimal 
(minimax)  strategies  would  be  easily  obtained.  The  function  in 
(7)  has  been  numerically  evaluated  for  c=0,  1,  0.2,  0.5,  0.75, 

1.0,  1.5,  2,  3,  4,  10  and  u,  v  =  0. 1(0.  1)  1.5.  Table  III.  2.1  shows 
the  values  for  the  case  c  =  1.0,  In  every  case,  u  =  0.1,  v  =  0.  1 
is  found  to  be  a  saddlepoint  of  the  matrix.  If  uq  and  vq  were 
different  from  0.  1,  the  saddlepoint  effect  would  still  be  preserved 
since  the  function  p(u,  v)  is  monotonic  decreasing  in  u  and  increasing 
in  v  throughout  the  range  considered. 


(7)  p(u.v)  .  M.  Kacv~u  wh=re  b  =  e(c+1,lv-“/c> 

V^l+b)  v 
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III.  3  A  Difference  Game 


The  objective  function  used  for  submarine-versus- 
submarine  detection  games  in  the  previous  MATHEMATICA 
report  [6]  and  also  in  Wagner  [5]  was  the  probability  of  detec¬ 
tion  of  the  transitor  by  the  patroller,  as  a  function  of  their 
speeds.  Parameters  representing  the  range  law  and  oceano¬ 
graphic  conditions  were  held  constant.  Using  the  notation  of 
[5  ]; if  u  and  v  are  normalized  speeds,  then  the  model,  which 

p 

we  shall  call  the  "Patroller's  Speed  Game"  G  ,  is  de  ribed 
as  follows: 


max  min 

u  C  u  <  u  v  <  v  <  v 
o  —  —mo—  m 


FP(u,  v)  = 


cv 


-4 


1.1.  2/  2 
1  +  U  /V 


where  c  is  a  positive  constant.  It  is  obvious  that  there  is  a 

symmetrical  position  for  the  transitor  to  take  in  this  game: 

this  results  in  a  model,  which  we  shall  call  the  "Transitor1  s 
T 

Speed  Game",  G  which  is  described  as  follows: 


min  max 

u  <  u  <  u  v  <  v  <  v 
o~  —mo—  m 


_T.  .  u/c-v 

F  (u,  v)  =  e  ' 


l+u2/v2 


Note  that  the  min  end  max  variables  are  reversed.  In  both 
games,  c  =  a/b  is  a  positive  constant  and  a  and  b  are  the 
slopes  of  the  noise  functions  for  the  transitor  and  the  patroller 


-50- 


in  terms  of  their  speeds: 


0 

0 

0 


1 


if 


I 

1 


N-Ju)  =  b{u-u  )  +  k  if  u  <u<u  , 

P  'o'  o  o  —  —  m 

N_(v)  =  a(v-v  )  +  '  ifv  <v<v 

T  'o'  o  o  —  —  m  * 

At  speeds  below  uq  ,  the  transitor's  noise  output  Np(u)  is 
(roughly)  constant  and  sivri'arly  for  N^,(v)  below  .  The 
maximum  achievable  speeds  for  the  patroller  and  the  transi- 
tor  are  assumed  to  be  u  and  v  respectively. 

The  analysis  of  a  patrol  speed  strategy  based  on 

p 

G  is  faulty  if  it  yields  solutions  which  in  fact  result  in  a 
situation  where  the  transitor  is  more  likely  to  secure  first 
detection.  An  example  of  this  error  occurs  in  the  game  de¬ 
scribed  in  Figure  6  of  ref.  [5  ],  which  assumes  that  a  =  b  ; 
the  analysis  leads  to  "optimal"  normalized  speeds  for  the 
patroller  of  0.  7500  and  for  the  transitor  of  0.  6054.  The  value 

p 

for  the  patroller's  game  G  is  1.  38;  but  the  transitor's 
T 

game  G  has  a  value  of  1.  74.  This  means  that  the  Iransi- 
tor  can  assure  himself  of  a  larger  "secure  sweep  width"  and 
accordingly  also  of  first  detection.  Given  the  broad  situation, 
there  is  no  reason  for  the  patroller  to  adopt  the  optimal  stra¬ 
tegy  of  G*3. 
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Our  formulation  of  a  Secure  Detection  Speed  Game  in 
the  first  place  will  involve  objectives  for  the  patroller  and  the 
transitor  which  are  not  complementary;  hence,  a  nonconstant- 
sum  game.  In  the  subsequent  analysis  we  will  reformulate 
the  game  in  such  a  way  that  it  becomes  a  constant-sum  game. 
Then  various  simplifying  assumptions  will  be  introduced  which 
lead  to  mathematically  tractable  payoff  functions. 

Consider  first  the  game: 

(SD^)  patroller  to  maximize  P(T  <  D  <  S)  ; 
transitor  to  maximize  P(S  <  D  <  T)  . 

This  game  assigns  symmetrical  objectives  to  the  patroller  and 
the  transitor;  both  are  to  choose  strategies  which  maximize 
the  probability  of  secure  detection  of  the  enemy.  The  solution 
of  this  game  is  conceptually  difficult  because  of  its  noncon¬ 
stant-sum  nature. 

A  constant-sum  relative  of  {SD  )  is  derived  next. 

o 

Let  ^  and  '  ^  (i  =  1,  2)  be  weights  reflecting  the  relative 
importance  of  detection  and  evasion  to  the  patroller  (i  1)  and 
the  transitor  (i  =  2).  Then  it  is  reasonable  to  represent  the 
payoffs  to  the  patroller  and  the  transitor  in  the  following  way: 
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patroller  to  maximize  f^P{T  <  D  <  S)  +  X^[l  -  P(S  <  D  <  T] 
transitor  to  maximize  <  D  <  T)  -  P(T  <  D  <S] 

which,  in  case  =  X2  and  fX^  =|j.2  *  *s  equivalent  to  the 
zero-sum  game: 

(SD.)  min  max  fciP(T  -  D  <  S)  -  \P(S  <  D  <  T)] 
sT  Sp 

where  and  2p  are  respectively  the  patroller’s  and  the 
transitor' s  strategy  spaces.  ' 

It  is  interesting  and  helpful  to  note  the  effect  of  assuming 
a  priori,  that  one  of  the  two  probabilities  in  the  objective  func¬ 
tion  of  (SDj)  must  be  zero.  This  assumption  would  be  natu¬ 
ral  if,  for  instance,  the  inherent  differences  between  the 
patroller's  and  the  transitor' s  equipment  made  it  virtually 
impossible  to  achieve  either  S  >  T  or  T  >  S  at  the  same 
time  as  fulfilling  other  requirements  of  the  mission.  Suppose 
then  that  P(T  <  D  £  S)  =  0  a  priori.  This  implies,  since 
P(D  <  S)  =  P(D  <  S,  D  <  T)  +  P(T  <  D  <  S)  and 
P(D  <  T)  =  P(D  <  S,  D  <  T)  +  P(S  <  D  <  T), 

that 

P(D  <  T)  -  P(D<S)  =  P(S  <  D  <  T); 
and  therefore  that  the  game  (SDj)  is  equivalent  to: 


1 


f 


-53- 


(SDJ  min 

2t 


max  [P(D  £  S/ 

V* 

“P 


P'D 


T)1 


The  same  conclusion  is  reac^  if  we  start  wit;-  >,he  *  priori 
assumption  that  P(S  <  D  T)  -  r  . 

The  game  (SDj)  vill  be  refen  co  herein  r  ■-  t‘  e 
"Difference  Game",  5  .  ■?rr\>  as  the  sumptions  or.  ^!n,n 
(SD2)  is  based  are  urmoce*  ,1  •?'  '»e.  and  toe 

mathematical  formui  . of  u.  analytically  too  ,om- 

plex,  we  shall  analyze  a  I’.hjJ-.L)  'U^fe-ont  tfcco'ir  /'n.  of 
(SDj) : 

(SDJ  min  max  [|jlP(D  <S  M  (D  <  T)1  . 

et  sp 

Substitution  of  parametric  forms  of  the  probabilities  in  (SD^) 
in  terms  of  the  strategy  variables  u  and  v  (the  normalized 
speeds)  results  in  the  payoff  function: 

G(u,  v)  =pLFP(u,v)  +A,(1  -  FT(u»v))  ;fi>  0  ,  A>  0  , 


The  patroller  is  the  u-player  and  maximizer;  the  transitor  is 
the  v-player  and  minimizer.  The  analysis  can  be  conducted 
without  specifying  the  values  of  the  utility  "weights",  M-and\  . 

Without  loss  of  generality,  since  [i>  0  and  both  H. 
andA.are  held  constant,  we  can  divide  through  by  f*  and 
represent  the  game  as: 


G  (u,  v)  = 


FP{u,  v) 


sFT{c,  v) 


=  (ecv~u  -  .e^’^l+uV 


where  s  =  ^  >  0  but  may  be  indefinitely  large.  In  thie 
game,  the  patroller  is  maximizing  and  the  transitor  is  mini¬ 
mizing  a  weighted  combination  of  the  probability  of  the  patroller 
detecting  the  transitor  and  the  probability  of  the  patroller 
avoiding  counter-detection  by  the  transitor. 

Both  F  (u,  v)  and  F  (u,  v)  are  convex  in  the  mini¬ 
mizing  player's  variable:  v  for  the  former,  u  for  the  latter. 
The  proof  is  easily  found  by  examination  of  the  first  or  second 
derivatives  of  the  functions.  Unfortunately,  the  convexity  of 

g 

G  is  not  so  easily  explored.  In  a  related  secure-detection 
game,  examination  of  the  numerical  evaluation  of  the  payoff 
function  (Table  III.  2.  1  in  this  report)  shows  that  optimal  pure 
strategies  exist:  they  are  the  slow  speeds  uq  and  vq  for 
the  patroller  and  the  transitor  respectively.  Under  a  mild 

restriction  on  three  parameters  of  the  game  (u  ,  c  and  s),we 

\ 

will  prove  that  the  same  strategies  are  minimax  for  (SD^). 


Lemma  HI.  3.1  If  v  =  v  is  any  fixed  number  in  Tv  ,v  1, 
-  1  i  o  mJ 

then  max  C-S(u,  v*)  =  GS(u  ,v  )  when  s  >  1  . 

o 

u.  <u<u_ 
o - m 


The  numerical  calculations  were  performed  for  detec¬ 
tion  distributions  which  are  exponential  with  range,  and  statis¬ 
tically  independent. 
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Ifj,,  iiu  WJJPJWfp  w'^/vv:  ki^  s*-ii£vL-*-i  U^s«fr!4si;  ^ULi^-Arf-1 t!4)>  g  frs:K 


-*'>•*•*.  ~~**-  *■  — 


Proof: 


G^(u,v  )  _.  0^j>1  +  sbev  TT/^' 


s  >  1  =£►!  •*•  <?beV  >  1  +  sb  > 


b+1  „  b+1 

t+7  ~  ivm 


s  >  l=£Gs(u  ,v*)  >  Gs(u,  v*)  for  all  u  >  u 


Lemma  III.  3.  2 


u 

If  ~  > 

c 


-/n(cs) 
iTc  ’ 


then 


niin  jjj  j|c 

VQ<v< v^  G  (u  iv)  =  G*(u  ,vq)  for  any  fixed  u  such 

$ 

that  u  <  u  <  u 
o  —  m 


Proof: 

The  minimum  over  all  ve[v  ,v  1  of  the  function  GS(u  ,  v) 

o  m*  o 

occurs  either  when  v  s  v  or  when  v  =  v  or  at  a  root  of  the 

o  m 

equation  G^u^vJsO.  Rearranging  terms  and  cancelling  non¬ 
zero  common  factors,  the  latter  equation  can  be  represented 
as 


1  +  v  + 


3/  2 
v  /uo  = 


1  +  c 


:ri+se^c+iHuo/c"v)' 

•  - 


Examination  of  the  two  functions  of  v  on  the  left-hand  side  and 

the  right-hand  side  of  the  last  equation  shows  that  it  either  has 

exactly  one  real  root  or  none  in  the  interval  {v  ,v  ),  and  that 

o  m 

the  condition  of  the  Lemma  guarantees  there  will  be  none. 

Furthermore,  under  the  condition,  G?{u  ,  v)  >  0  for  all 

«  o 
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g 

v  in  the  permitted  range,  so  that  G  (uq,  v)  takes  its  minimum 

at  the  lower  er  d  ooint  of  the  range:  v  ~  v 
r  ^  mm  o 

*  211,3.3  me  conditions  —  > — snd  s  >  1 

-  e  1+c  — 

g 

are  sufficient  for  (u  ,  v  )  1°  he  a  saddle  point  of  G  (u,  v)  . 

For  all  (u,  v)  such  that  u  <  u  <  u  ,  v  <v<v  , 

o  mo  —  —  m 

G(u,  v  )  <  (G(un,  v  J  <  G(u  .  v).  When  c  >  1,  the  first  con - 

O  OO  G 

dition  is  unnecessary,  as  it  is  implied  by  the  second. 

Proof:  This  is  a  direct  consequence  of  Lemmas  III.  3.1  and 
III.  3.  2.  We  have  now  proved: 

Theorem  The  pure  strategies  u  »  uq  ar*d  v  =  VQ  are  op¬ 
timal  for  the  game  G3,  when  s  is  sufficiently  large  (and  in 
any  case  not  less  than  1). 

As  an  interesting  special  case,  suppose  that  the  patrol¬ 
ler  and  the  transitor,  being  matched  in  equipment,  have  the 
same  noise  functions  so  that  c  =  1.  Then  the  theorem's  con¬ 
clusion  provides  a  recommendation  that  if  s  =  1  both  sub¬ 
marines  should  travel  at  their  minimal  speeds*  (u  ,  v  )  .  Now 

r  o  o 

suppose  that  the  transitor  has  equipment  that  is  twice  as  quiet 
as  the  patroller's:  c  =  0.5.  Then  the  theorem  is  only  opera¬ 
tive  when  s  >  2;  but  the  pure  strategy  solution  (uq,  vq)  may 

$ 

(in  terms  of  the  noise  functions  this  :.s  the  speed  such 
that  any  slower  speed  is  likely  to  produce  essentially  the  same 
noise  output). 


be  true  for  e 


<  *.  b.3  well.  The  fcraiioitar'  a  advantage  of 
o.-;  -•'•-o'  -  ■  -.I.,*,.  nag  *  greater  chance  to  detect  the 

patroller  before  being  detected.  As  a.  result,  we  can  only 
recommend  the  strategy  (uQ,  v  )  If  the  patroller  values  avoid¬ 
ing  counter-detection  twice  as  highly  as  a  detection  of  a  transi- 

tor,  or  believes  it  twice  as  likely  that  the  transitor  will  play 

T 

the  Transitor1  s  Speed  Game  G  as  that  he  will  play  the 

p 

Patroller's  Speed  Game  G  . 

The  pure  strategy  solution  (uq,  vq)  of  the  Difference 
Games  GS  is  not  in  agreement  with  most  of  the  solutions 
found  for  the  examples  in  [5]  for  the  Patroller's  Speed  Game. 
We  show  in  the  table  below  a  comparison  of  the  solutions  in  the 
case  of  eleven  examples  presented  in  [5  ].  The  last  column 
of  the  table  indicates  the  probability  that  the  solutions  offered 
in  [5  ]  violate  the  non-probabilistic  condition  (cv  >  u)  for  a 
secure  detection  by  the  patroller. 

These  examples  make  it  clear  that,  while  the  simpli¬ 
fied  speed  model  for  secure  detection  of  the  present  section 
cannot  guarantee  that  detection  will  be  secure  in  cases  where 
the  patroller  must  u a  higher  speed  than  the  transitor  (ex¬ 
amples  1,  2, 6, 7,  8,  9),  it  can  provide  a  better  guide  to  strategy 
in  other  cases  (4  and  6).  For  the  remaining  examples,  the 
comparison  shows  interesting  differences  (except  10)  which 
are  not  yat  fully  explained. 


1. 

II 

O 

(.50, 

.  vb) 

(.8,  .4) 

50% 

2. 

C=1 

«.  75, 

.25) 

(.7500,  .6054) 

100% 

3. 

c=2 

(.00, 

.02) 

(.2000,  .2229) 

0 

4. 

c-1 

Mr 

.25) 

A.  1500  wp  .3972  45qm 
VX  69.81  wp  .  6028"  459A' 

100% 

5. 

c=l 

(.00, 

.02) 

/jr.  0000  wp  .1350 
^\.2000  wp  ,8650"2852J 

0 

6. 

C=1 

(.10, 

.05) 

(.90,  .30) 

100% 

7. 

C=1 

(.70, 

.  30) 

(.70,  .55) 

100% 

8. 

c=l 

(.05, 

.02) 

(.65,  .30) 

100% 

9. 

c=2 

(.50, 

.45) 

(.7179,  .4500) 

0 

0. 

C=1 

(.50, 

.55) 

(.50,  .55) 

0 

1. 

c=2 

(.40, 

.40) 

(.70,  .40) 

0 

*  wp  =  with  probability 


SB 


HI.  4  Large  Structured  Matrix  Games 

Introduction 

We  consider  classes  of  large  matrix  games  in  study¬ 
ing  either  game-theoretic  aspects  cf  barriers,  i.  e. ,  an  array 
of  specified  zones  in  which  patrolling  submarines  attempt  to 
detect  transitors,  or  aspects  of  the  patrol  strategies  used  by 
a  patroller  within  a  zone.  In  these  matrices,  entries  corres¬ 
pond  to  (are  proportional  to)  the  probability  of  th 2  patroller 
detecting  the  transitor,  columns  representing  a  choice  of 
transit  lane  by  the  transitor,  and  rows  the  choice  of  a  "pure 
patrol  strategy"  by  the  patroller.  The  motivation  for  consi¬ 
dering  matrix  games  should  be  clear ;  computational  methods 
for  solving  them  and  theorems  for  analyzing  them  abound. 
However,  in  order  to  obtain  any  results  of  interest  for  the 
problems  at  hand  large  games  need  to  be  considered.  Analy¬ 
sis  indicates  that  if  large  matrix  games  are  to  be  solved,  either 
they  must  possess  a  strikingly  symmetric  structure  which 
permits  explicit  analytic  solution, or  realistic- looking  matri¬ 
ces  must  be  generated  from  data  obtained  in  fleet  er.ercises 
and  solved  by  techniques  from  linear  programming.  Both  of 
these  approaches  are  discussed  below. 


Toeplitz  Matrix  Games 


We  consider  square  matrix  games  G  =  (g..)  where 
g.j  =  f(|i-jj),  that  is.,  the  entry  g^  is  a  function  of  the 
distance  of  the  entry  {? ,  j)  from  the  main  diagonal.  It  seems 
particularly  appropriate  to  consider  functions  f  which  are 


monotonic  non-increasing  if  we  interpret  pure  row  strategy 
i  as  that  of  hovering  at  station  i  (i  =  1,  .  .  .  ,  n)  .  Im¬ 
plementation  of  successive  row  strategies  i  and  i  +  1  can 
be  made  in  a  straight-line  patrol  between  stations  i  and 
i  +  1  within  a  zone.  A  mixed  strategy  may  then  be  inter¬ 
preted  as  the  proportion  of  time  spent  hovering  in  the  vicinity 
of  i  in  a  back-and-forth  patrol. 

In  previous  work  MATHEMATICA  considered  an 
m(2k  -  l)-station  problem  G  with  g.j  =  f  (|i-j|)  =  1  for 
|  i“j |  <  k  -  1  frnd  g..  =  f<|i-j|)  =  0  for  {i-jj  >  k  -  1 
where  k  >  1  .  This  corresponds  to  the  assumption  that  the 
transitor  is  detected  if  he  traverses  at  any  one  of  the  k  -  1 
adjacent  stations  on  either  side  of  the  patroller’s  station. 

Call  such  games,  which  are  defined  by  specification  of  m 
and  k  ,  H(m,  k)  .  For  example,  the  game  H(2, 2)  is  defined 
by  the  6x6  matrix 
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H(2,  2) 


It  is  convenient  to  think  of  m  as  the  number  of  iden¬ 
tical  blocks  or  square  submatrices  of  dimension  2k  -  1 
which  lie  along  the  main  diagonal  of  H(m,  k).  Using  these 
concepts  it  is  easy  to  show  that  the  value  of  the  game  H{m,  k) 
is  l/m;  an  optimal  strategy  for  the  patroller  or  row  player 
is  to  choose  row  i  with  probability  l/m  ’£  i  =  k  (mod 
k  +  1)  and  with  probability  zero  if  i  jjh.  k  (mod  k  +  1)  ; 
and  an  optimal  strategy  for  the  transitor  or  column  player  is 
to  choose  some  column  from  each  block  and  play  this  column 
with  probability  l/m  .  Thus,  for  example,  the  value  of 
H(2,  2)  is  1/2  ;  the  patroller's  optimal  strategy  is  (0,^-,  0,  0,j,  0); 
and  the  transitor's  optimal  strategy  is  (-y,  0,  0,0,0,-^)  • 

The  unsatisfactory  nature  of  games  H(m,  k)  is  that 
they  make  a  "cookie  cutter"  assumption  for  the  range  law: 
any  transitor  within  k  -  1  stations  of  the  patroller  is  surely 
detected,  any  transitor  further  away  is  never  detected. 
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Graphically, 


Figure  3. 3 


Consider,  now,  an  mk- station  problem  G  with 
g  =  «|i-j[)  =  k  'fail,  for  |i-j|  <  k  -  i  and  gy  = 
£{|i-j|)  =  0  for  ji-j  j  >  k  -  1  where  k  >  1  .  This 
corresponds  to  the  assumption  that  the  transitor  is  detected 
with  probability  1  -  */k  if  he  traverses  in  a  lane  <  -distant 
from  the  patroller1  s  station.  Thus  the  cookie-cutter  range 
law  is  discarded  and,  instead,  the  range  law  is  that  proba¬ 
bility  of  detection  decreases  as  distance  increases. 
Specifically, 
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Probability  of  detection 


jEc’i'w  ~ f<P 


t 


We  call  such  games,  which  are  again  defined  by  spe 
cification  of  m  and  k,  G(m,  k)  .  Again,  it  is  convenient 
to  think  of  m  as  the  number  of  identical  blocks  or  square 
submatrices  cf  dimension  k  which  are  of  the  form 


k 


k-1 

k  k-1 


2 


1 

2 

• 

k 


and  lie  along  the  main  diagonal  of  G(m,  k)  .  For  example, 
G(2, 3)  is  the  6x6  matrix 


Jg}\ 


3  2  1, 


2  3  2  1 


12  3  12 


1  2  3  2  2 


1  2  3  t. 


•12  3 


=  G{2,3) 


We  have  found  that  the  value  and  optimal  strategies 


for  any  game  G(m,  k)  can  be  specified. 

The  value  of  the  game  G{m,  k)"is 


mk+i 


.  An 


optimal  strategy  for  the  patroller  or  row  player  is  to  choose 
row  ik+1  (i  =  0,  ...»  m-1)  with  probability  (m-i)/m(m+l)  ; 
row  ik  (i  =  1,  ...»  m)  with  probability  i/m(m+l)  ;  and 
all  other  rows  with  probability  zero.  Symmetrically,  the 
transitor  or  column  player  is  to  choose  columns  i  with  the 
same  probability  with  which  the  row  player  chooses  rows  i  . 
Thus,  for  example,  the  value  of  the  game  G(2, 3)  is  7/l8 
and  optimal  strategies  for  both  players  are  l/ 6[2, 0, 1, 1, 0, 2]. 

To  prove  this  statement  it  suffices  to  show  that  the 
expected  gains  or  winnings  of  the  row  player  against  any  pure 
strategy  of  the  column  player  is  precisely  the  value  of  the 
game.  Then,  since  the  game  matrix  is  symmetric,  the  iden¬ 
tical  strategy  for  the  column  player  assures  him  of  losing 
the  value  as  well.  So,  consider  any  column,  say  column 

.  Its  non-zero  entries  lie  within  the  rows  2k+j-k  through 
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(k  +  j  +  k  .  Within  these,  the  value  of  the  entry  in  row  ik 
is  k  -  j  ;  in  row  fk  +  1  is  k  -  j  +  J.  ;  in  row  (£+l)k  is  j  ; 
and  in  row  Q+l)  k  +  1  is  j  -  1  .  Thus,  since  the  mixed 
strategy  above  plays  only  these  rows  with  non-zero  probabili¬ 
ties,  the  expected  winnings  against  column  fk  +  j  is 

+  (k“j+1)  +  +  (m-/-l)(j-l)j 

mk.-f  1 
km(m+i) 

thus  proving  the  assertions. 

Notice  that  as,  k  becomes  large,  the  value  of  the 
game  approaches  (m  +  1) }  Further,  the  parameter  k  enters 
only  for  purposes  of  determining  the  cycle  of  non  zero  choi¬ 
ces  of  pure  strategies.  Thus,  as  an  average  figure,  we  ob¬ 
tain  the  approximate  result  that  a  patroller  able  to  detect  a 
transitor  with  probability  1  -  ^  ,  if  x  <  d  where  x  is  the 
distance  between  them,  d  the  maximum  distance  at  which 
detection  is  possible,  has  probability  (m  +  1)  ^of  detecting  a 
transitor,  where  D  is  the  width  of  the  patrol  zone.  In 

J 

G(m,  k)  we  interpret  d  =  k  and  D  =  mk  so  that 
is  (m  +  1)  \  agreeing  with  the  result  above  as  k  becomes 
large  (which  is  essentially  a  change  of  scale).  Thus  it  would 
seem  that  the  "measure  of  effectiveness"  ^-j-  should 
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I 


have  a  bearing  on  the  design  of  barriers,  and,  more  parti¬ 
cularly,  on  the  definition  of  the  width  D  of  a  patrol  zone, 
in  relation  to  the  secure  sweep  width,  which  is  proportional 


to  d  . 


Assume  that  d  is  known  and  that  D  may  be  chosen 


in  such  a  way  as  to  make  D/d  =  m  integer.  The  implica¬ 
tions  of  the  assumed  range  law,  with  its  maximum  range  d 
for  a  detection  to  occur  with  non-zero  probability,  are  then 
as  follows:  When  m  is  1  and  the  range  of  detection  is  the 
whole  width  of  the  barrier  zone,  the  patroller's  mixed  stra¬ 
tegy  assigns  equal  weight  to  each  end  of  the  zone  and  zero 
and  zero  weight  to  the  "center".  When  m  =  2  and  the 
range  of  detection  is  half  the  zone  width,  the  mixed  strategy 
assigns  essentially  equal  weight  to  the  "center"  of  the  zone 
and  each  end  of  the  zone.  At  the  other  extreme,  when  m  is 
very  large  (range  of  detection  is  a  very  small  fraction  of  zone 
width)  the  mixed  strategy  for  patrol  ’.s signs  equal  non-zero 
weight  to  pairs  of  adjacent  interior  stations  spaced  out  d 
units  apart,  and  double  weight  to  each  end  station.  For  inter¬ 
mediate  values  of  m  the  results  are  qualitatively  the  same, 
except  that  the  optimality  is  more  sensitive  to  the  correct  posi¬ 
tioning  of  the  individual  stations.  The  main  point  to  realize 
about  the  nature  of  these  mixed  strategies  is  that  they  deci¬ 
sively  do  not  recommend  a  uniform  patrol  back  and  forth 
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.v 


across  the  zone.  Instead,  the  designation  of  a  particular  set 
of  m+1  rows  out  of  the  mk  rows  in  the  game  matrix  for  use 
in  the  patrol  strategy  is  equivalent  to  a  recommendation  of  a 
number  of  favored  positions  for  optimal  patrol  in  the  harrier. 
Of  course  the  recommendation  is  contingent  upon  the  particu¬ 
lar  form  of  the  range  law  assumed.  Other  range  laws  will 
produce  different  mixed  strategies;  in  general  it  is  reasonable 
to  expect  that  they  will  also  show  a  pattern  which  favors  cer¬ 
tain  distinguished  points  of  the  zonal  width. 

Large  Matrix  Games 

A  difficulty  with  many  of  the  analytic  models  discussed 
previously  is  that  the  strategy  spaces  of  both  the  patroller 
and  the  transitor  are  considerably  too  restrictive  to  adequate¬ 
ly  represent  actual  strategy  alternatives.  A  methodological 
approach  which  would  allow  at  least  some  analytic  probing 
of  tradeoffs  between  such  factors  as  speed,  use  of  active 
sonar,  patrol  patterns,  etc. ,  is  described  below.  AL'ow  the 
transitor 8  pure  strategies  which  depend  upon  the  choice  of 
transit  lane  and  choice  of  speed;  allow  the  patrollers  pure 
strategies  which  depend  upon  varying  speeds,  type  of  patrol 
(e.  g. ,  bow-tie,  back  and  forth);  but  in  all  cases  assume  that 
the  set  of  choices  within  each  category  is  finite.  For  every 


I 


0 


choice,  which  can  be  represented  by  a  t-tuple  of  integers  for 
the  transitor  and  a  p-tuple  of  integers  for  the  patroller,  find, 
according  to  given  functions  or  from  data  produced  by  fleet 
exercises,  probabilities  of  detection,  or  of  first  detection,  or 
of  no  detection,  etc. ,  by  the  patroller  and  the  transitor.  On 
the  basis  of  these,  as  sign  subjective  values  to  the  patroller  for 
mutually  exclusive  outcomes.  This  results  in  a  matrix  having 
as  many  rows  as  the  product  of  the  number  of  speeds,  patrols, 
etc. ,  available  to  the  patroller,  and  as  many  columns  as  the 
product  of  the  number  of  transit  lanes,  speeds,  etc. ,  avail¬ 
able  to  the  transitor.  This  matrix  game  could  then  be  solved 
by  means  of  linear  programming  to  determine  optimal  mixed 
strategy  patrols  for  the  patroller  and  transits  for  the  transi¬ 
tor.  The  difficulties  with  this  approach  lie  in  two  areas: 
availability  of  data  concerning  probabilities  of  detection  as  a 
function  of  speeds,  patrol  patterns,  etc. ,  and  the  size  of  the 
resulting  matrix.  Nevertheless,  extremely  useful  insights 
could  be  obtained. 
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The  size  of  the  resulting  problem  may  appear  to  be 
too  large.  Current  computer  limitations  would  allow  at  most 
3ome  1000  pure  strategies  for  one  player,  though,  a  practi¬ 
cally  unlimited  number  for  the  other.  The  hope,  of  course, 
is  that  some  form  of  decomposition  approach  might  lend  itself 
to  permit  efficient  computation  of  large  problems.  Unfor¬ 
tunately  it  seems  that  the  structure  of  most  matrix  games 
does  not  allow  such  an  approach.  The  basic  reason  for  this 
can  be  summarized  by  saying  that  the  value  of  the  game  is  a 
complicated  rational  function  of  the  values  of  the  matrix 
entries.  Thu  simplest  applicable  type  of  decomposition 
is  for  matrices  which  are  the  tensor  product  of  other  matri¬ 
ces.  This  type  of  approach  was  investigated  in  ref  [7]  through 
rather  involved  and  lengthy  arguments.  We  present  those 
ideas  in  a  direct  and  simple  manner  here. 

Let  A  be  a  matrix  game,  x  a  mixed  strategy  on 
rows,  y  a  mixed  strategy  on  columns.  Then,  by  definition, 
x  and  y  are  optimal  and  A  is  the  value  of  the  game  A 
if  and  only  if 

x*A  >»  \e  ,  and  Ay*  <  Xe  . 

where  e  is  a  vector  of  l's  (one't-)  of  appropriate  dimen¬ 
sion.  We  have  the  following  result: 


-70- 


Let  x  and  y  be  optimal  for  A  with  value 
X  >  0,  and  x  ,  y  be  optimal  for  A  with 
value  X  >  0.  Assume  Ay  >  0  and 
Ay  ^  0  (it  is  sufficient  to  have  A,  A  >0 
for  this  assumption  to  hold).  Then  x(^x 
and  y^?)y  are  optimal  for  the  game  A0A  , 
the  tensor  product  of  A  and  A  ,  with  value 


The  proof  of  this  is  trivial, for 

(xg)x)  (A£g)A)  =  xA(3)xA"  >  XX  e  and 

(A®A)  (ygy)  =  Ay$A  y  <  AX  e  . 

the  last  irjgqfcality  holding  by  the  explicit  assumption  made  in 
the  statement  above. 

As  an  example  consider 


Then  x  =  (3/5,  2/5)  ,  y  =  (3/5  ,  2/5)  ,  X  =  1/5  ;  and 
x  =  (1/16  ,  4/16  ,  11/16)  ,  7  =  (6/16  ,  2/16  ,  8/16)  , 
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X  =  18/16  ;  and  the  assumption  Ay  i  0  ,  Ay  >  0  i 
satisfied.  Therefore,  the  optimal  solution  to 


is 


A  0  A  = 


is 


with  value 


x  0  x  =  -gg-  (3,12,33,2,8,22), 


y  ®  y  =  -  (18,6,24,12,4,16), 
80 


\\  18 
AA  =  M 


This  result  is  applicable  to  situations  where  the  effects 
of  two  parameters  are  multiplicative.  For  example,  if  the 
effect  of  patrol  pattern  on  detection-probability  was  indepen¬ 
dent  of  the  effect  ox  speed,  the  payoff-matrix  Gpg  obtained 
when  both  parameters  were  taken  as  strategic  variables  would 


-72- 


be  proportional  to  the  tensor  product  ®  Gg  of  the  pay¬ 
off  matrix  G^  of  the  "pattern  game"  with  the  payoff  matrix 

G  of  the  "speed  game", 
s 

True  independence  of  such  a  pair  of  parameters,  of 
course,  would  be  a  rare  circumstance  --  but  if  the  factors 
were  nearly  independent,  the  above  tensor-product  theorem 
would  provide  a  useful  first  approximation  to  a  solution. 

If  the  strategic  choices  for  each  player  involved  more 
than  two  parameters,  we  could  employ  the  obvious  generali¬ 
zation  of  the  above  theorem  to  a  tensor  product  of  n  matri¬ 
ces. 

Theorem:  For  each  a  =  1,  ....  n,  suppose  Aa  is  the 
matrix  of  a  game  which  has  optimal  strategies  xtt  and  ya  , 
and  value  \a  >  0  .  If  also  Aaya  >  0  for  each  a  ,  then 
the  game  whose  matrix  is  the  n-fold  tensor  product  A  =  A* 

0  A2  0  . .  .0  An  has  value  A  =  ^2  . .  .  \n  and  opti- 

12  n  1 

mal  strategies  x  =  x  ®  x  0  . .  0  x  and  y  =  y  0 

y2  0  . . .  0  y11. 
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in.  5  Continuous  Analogs  of  Toeplitz  Matrix  Games 


In  Part  III.  4  "Large  Structured  Matrix  Games,"  a 
class  of  large  matrix  games  G(m,k)  was  introduced  in 
which  the  transitor  has  as  pure  strategies  the  choice  of  one 
of  a  finite  number  of  transit  lanes  (choice  of  a  column  in 
G(m,  k))  and  the  patroller  has  as  pure  strategies  the  choice 
of  one  of  a  finite  number  of  stations  at  which  he  can  hover. 
Then,  the  probability  that  the  patroller  detects  the  transitor 
is  (k  -  d)/k  if  d  <  k  and  0  if  d  >  k  ,  where  d  is  the 
distance  between  the  patroller  and  the  transitor. 

It  is  natural  to  consider  the  continuous  analog  of  this 
class  of  matrix  games.  This  game  is  defined  as  follows:  . 
Let 

^<x.y)  =  1  -  if  |x-y|  <  d 

=  0  if  |x  -  y|>  d 


where  0  <  x,  y  <£  D  ,  be  the  payoff  of  player  II,  the  y- 
player,  to  player  I,  the  x-player.  The  interpretation  for 
this  game  is:  the  patroller  has  as  pure  strategies  the  choice 
of  a  point  x  on  the  line  [0,D],  the  transitor  has  as  pure 
strategies  the  choice  of  a  point  y  on  the  line  [0,D]  ;  the 


probability  of  the  patroller  detecting  the  transitor  is  given 
by  ^{x,  y)  .  Thus  d  is  the  maximum  range  in  which  detec¬ 
tion  is  possible;  D  is  the  width  of  the  patrol  zone.  For 
fixed  x  ,  say  x  =  x  ,  ^(x,  y)  describes  the  range  law; 


$>  (x,  y) 


Figure  3.  5 


It  is  most  surprising  to  find  that  both  players  have 
optimal  mixed  strategies  which  each  use  only  a  finite  num¬ 
ber  of  pure  strategies.  A  mixed  strategy  is  a  probability 
distribution  f  on  the  choice  point  x  for  player  I,  and  a  prob¬ 
ability  distribution  g  on  the  choice  of  point  y  for  player 


The  value  of  the  game  defined  above,  for  D/d  not 


integer -valued,  is 


2n  -  D/d 
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where  n  =  [D/d]  +  1  ,  with  [.]  denoting  “integer  part  of. " 

Note  that  n  D/d  so  that  the  value  of  the  game  is  about 
d 

gyy  ,  as  was  indicated  in  an  approximate  manner  in  Part 

III.  4  .  ,vn  optimal  strategy  for  player  I  is  to  choose  a  distri¬ 
ct 

bution  f  (x)  defined  as  follows: 


f*(x)  =  f*  (x)  +  f*  (x) 


where 


fL(x)  =  fR(x)  =  0  if  x  /  kd  j  x  /  D  -  kd  f 


k  integer  , 

£I<kd>  *  JTpTTT  *  £>  '  kd>  - 

where,  again,  n  =  [D/d]  +  1  .  An  optimal  strategy  for 

*  $ 

player  II  is  to  choose  a  distribution  g  (y)  where  g  (y)  = 
g*(y)  +  g£(y)  ,  with  g*  (y)  =  f* (y)  ,  g^(y)  =  fR(y)  . 

For  example,  if  d  =  5  ,  D  =  16  we  have  n  = 
[16/5]  +1  =  4;  the  value  of  the  game  is  24/100;  and 
each  player  uses  8  pure  strategies 

f>  =  4s  =  4<16)  *  4<5>  = 
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4<io>  *  is  -  4<6)  ;  4fl5>  =  w  =  4<*>  • 

*  * 

g  is  the  same  as  f  .  Graphically  this  means  that  the 
patroller  should  circle  at  one  of  8  stations  as  indicated  in 
Figure  3.  6  below. 


2  3 

JU  JU 


10  11 


1  4 

"ZIT  JU 


15  D=16 


Let  E(  ;  f,  g)  denote  the  expected  winnings  of 
player  I  given  that  1  uses  mixed  strategy  f  and  II  uses 
mixed  strategy  g  .  To  prove  the  contention  above  it  suffices 
to  show 

E(  ;  i,  g*)  <  E(^;f*,g*)  <  E(  pt;  f*,g)  , 
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♦  ♦  * 
that  is,  f  is  optimal  against  g  ,  and  g  is  optimal  against 

£*. 


We  first  compute  ;  f,  g*) 


D-kd 


+  nlnVlT  JQ  (n  *  k)U  f  (d-D  +  kd  +  x)df 


D-kd-d 


D-kd+d 


i  f 


(d  -  kd  +  D  -  x)  df 


D-kd 


since  fi(x,y)  =  0  for  |x-y  |  >  d  ,  where  we  hive  ''broken 
up"  the  interval  [0,D]  into  [0,  D-nd+d,  d]  ,  [D-nd+d,  d], 

[d,  D-nd+2d],  etc.  ;  or,  in  general  into  [hd-d,  D-nd+hd],  [D-nd+hd,  hd]. 
Consider,  first,  the  contribution  to  E(  f\  f,  g  )  due  to  the 
interval  [hd-d,  D-nd+hd]  .  From  the  first  sum,  with  j  = 
h-1  and  j  =  h  we  get 


|  (d-xthd-d)df  and  f  (x+d-hd)df 

t 

while  from  the  second  sum  with  k  =  n-h  and  k  =  n-h+1 
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and 


■Ir-'Tiya'  j  <d  -  D  +  (n  -  h)  d  -  x) 


‘■fr  ”  lid  ~  -  /  (d-(n-h+l)d  +  D-x) 


Combining  these  results  we  obtain 


r  jZn  -  p/d) 

J  n(n  +  1) 


Precisely  the  same  result  obtains  when  integration  is  per¬ 
formed  over  [D  -  nd  +  hd,  hd]  .  Thus  we  find 


*,  (2n  -  D/d) 


2n  -  D/d 


This  says  that  against  the  strategy  g  player  I  can  choose 

any  mixed  strategy  and  obtain  the  vailue  (2n  -  D/d)/n(n  +  1)  . 

Thus,  in  particular,  f  =  f  maximizes  E(  ;  f ,  g  )  . 

#  /  # 

Similarly,  g  =  g  minimizes  E(0;f  ,  g)  .  Therefore, 

*  # 

f  ,  g  are  optimal  strategies  and  the  value  of  the  game  is 
(2n  -  D/d)/n(n  +  1)  . 

For  the  D/d  integer  valued  it  is  easy  to  verify  that 


the  value  of  the  game  is  (n  +  1)  ^where  n  =  D/d  ,  and 
both  players  play  points  kd  (k  =  0,...,n)  with  probability 
(n+lf 1  . 

It  seems  clear,  as  is  the  case  for  the  optimal  strate- 

$  $ 

gies  for  the  games  G(m,  k),  that  f  and  g  are  unique. 

So  we  obtain  similar  results,  about  the  same  "measure  of 


effectiveness" 


-  f  and,  again,  a  recommendation  that 


the  patrollerc  should  use  a  finite  number  of  favored  stations 
or  positions  for  specified  proportions  of  time.  Of  course, 
the  result  depends  upon  the  particular  range  law  ^(x,y) 
which  is  chosen.  Nonetheless,  reasonable  range  laws  would 
seem  to  lead  to  the  same  type  of  qualitative  result;  certain 
distinguished  stations  should  be  used. 

The  game  j£(x,  y)  which  has  been  analyzed  above  is 
the  continuous  analog  of  the  matrix  games  G(m,  k)  .  For 
pmrjoses  of  comparison  it  is  interesting  to  consider  the  con¬ 
tinuous  analog  of  the  matrix  games  H(m,  k)  (see  Part  III.  4). 
It  is  natural  to  formulate  this  game  as  follows.  Let 


Y(x,y)  =  1  if  X  -  y  <  d 


=  0  if  x-y>d  , 


where  0<x,y  <D,  be  the  "payoff'  of  player  II,  the  y-player, 
to  player  I,  the  x-player.  Here,  again,  D  i  s  the  width  of 


the  patrol  zone,  and  d  is  the  maximum  range  in  which  sure 
detection  is  possible  with  probability  ^  =  1  .  Thus,  we 
have  the  cookie-cutter  range  law:  for  fixed  x  ,  say  x  «  x 
'^/"(x,y)  describes  the  range  law 


The  value  of  the  game  ,  for  D/2d  not  integer 
valued,  is 

_1_ 

m 

where  m  =  [D/2d]  +  1  .  Note  that  m  D/2d  so  that  the 

"measure  of  effectiveness"  is  about  VD  .  An  optimal  stra^ 

$ 

tegy  for  player  I  is  to  choose  a  distribution  f  (x)  for  which 


."  **  '*•  »■***■>-▼**«»*  •■..?>  *v  * 


f  ((2k  +  1)  d)  =  ~  ,  k  =  0,  ... ,  m  -  2  ,(k  =  0,  m-2), 

f*(x*)  =  ~  where  x*  =  min  ^  (2  m  -  1)  d,  , 

$ 

f  (x)  =  0  otherwise; 

and  an  optimal  strategy  for  player  II  is  to  choose  a  distribu- 
* 

tion  g  (y) 

g*(k(2d  +  £  »  =  ±  ,  k  =  0,  .  . . ,  -  1  .  (k  ■  0. . . ,  m-1) 

4* 

g  (y)  =  0  otherwise  . 

Here  0  <  e  <  (D/2d  -  m  +  l)2d/  (m-1).  The  basic 
point  of  the  transitor's  strategy  i 8  that  m  points  y  should 
be  used, but  that  each  pair  of  such  points  should  be  distant 
more  than  2d  . 

For  example,  if  d  =  5  ,  D  =  16  ,  then  m  =  2  ; 
the  value  of  the  game  is  ^  and  each  player  uses  2  pure  stra¬ 
tegies;  *  may  be  between  0  and  6  (we  choose  0.  5): 

f*(5)  =  f*(15)  =  \  ,  g*(0)  =  g*(10. 5)  =  ^  • 

The  proof  of  this  statement  is  straightforward. 
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E(Y;f,g  )  = 


_1_ 

m 


/ 


df  + 


_L 

m 


m=l 

2 

k=l 


(2k+l)d+k  t 


/ 


df 


(2k-l)d+kt 


Thus,,  an  f  which  maximizes  is  any  distribution  f  which 
has  all  its  weight  concentrated  in  the  intervals  specified  in 
the  above  definite  integrals,  f  =  f  is  such  a  distribution. 
So,  EC'J'';  f,  g*)  <  E0£';  f*.  g*)  .  The  inequality  EpJ'';  f*,  g*) 
<  E('^;f  ,  g)  is  immediate. 

If,  on  the  other  hand,  D/2d  is  integer  valued  it  is 
easy  to  see  that  the  value  of  the  game  is  l/m  ,  with  m  = 
D/2d  ,  player  I  plays  points  (2k  +  l)d  ,  k=0,  ...»  m-1, 
with  probability  ,  player  II  plays  points  k(2d  +  *  )  , 

K  =  0,  ....  m  -  1  ,  with  probability  ^ ,  where  0  <  «  <: 
2d/m  -  1  . 

This  singles  out  an  interesting  observation.  Namely, 

if  one  believes  that  the  range  law  is  of  cookie-cutter  type, 

then  the  width  of  patrol  zone  D  should  be  chosen  so  that 

D/2d  is  integer  valued.  For  if  it  is  not  so  chosen  then  the 

patrol  width  might  just  as  well  be  increased:  this  can  lead  to 

no  decrease  in  the  probability  of  detection.  This  seemingly 

peculiar  result  is  a  consequence  of  che  discontinuous  nature 

of  "^(x,  y)  .  Another  way  of  pointing  to  this  peculiarity  is  to 

note  that  if  D/2d  =  m  is  integer-valued  then  the  value  of  the 

game  is  ~  .while  if  the  width  is  increased  to  D  +  6  ,  with 
m 

6  >  0  but  arbitrarily  small,  then  the  value  of  the  game 
jumps  down  to  l/(m  +  1)  . 


Finally,  notice  that  the  measure  of  effectiveness  for 
^  with  defining  parameters  d,D  is  about  2d/D  while  that 
for  ^  with  defining  parameters  d,  D  is  about  .  In 

terms  of  the  extended  definition  ot  given  in  Part  II  this 
means  that  the  cookie-cuttei  range  law  yields  an  SSW  of  2d 

while  that  of  the  "linearly  dec* easing”  »ange  law  yields  an 
cctir  dD 


III.  6  Repeated  Games 


Recent  research  in  the  mathematical  theory  of  games 
by  J.  Harsanyi  [3]  ,  and  R.  Aumann  and  M.  Maschler  [1], 

[2]  has  resulted  in  a  theory  which  provides  strategic  analysis 
for  a  sequence  of  repeated  conflict  sitiations  in  which  the 
players  --  the  patroller  and  the  transitor,  for  instance  -- 
have  incomplete  information  about  the  payoffs.  This  section 
represents  an  attempt  to  apply  some  of  their  results  to  ASW, 
and  to  interpret  them  in  the  context  of  a  barrier  situation. 

Repeated  Zero-sum  Games  of  Incomplete  Information 
are  formal  models  of  a  sequence  of  closely  related  strategic 
situations.  Two  players  are  to  make  choices  of  strategy  at 
each  stage  of  an  indefinitely  long  sequence  of  matrix  games. 
One  or  both  of  the  players  may  lack  certain  information 
which  would  specify  the  true  payoffs  at  each  stage.  The  al¬ 
ternative  matrix  games  which  determine  the  actual  sequence 


being  played  are  known  to  both:  call  them  G.,  G,»  ...»  G  i 
where  G^  is  an  m  x  n  matrix  game  for  each  i  .  In  addi¬ 


tion,  a  prior  probability  distribution  over  the  alternatives, 


(q^  *  ^2  »  •  •  •  *  ^q,)  representing  the  state  of  information 
of  the  uninformed  player  is  known  to  both  players. 


See  Bibliography  on  page  109  of  this  report. 

Assuming  for  the  purposes  of  discussion  that  only 
one  player  has  incomplete  information. 


Naturally  the  sum  of  the  q^'s  is  1;  in  the  case  where  one  of 
the  q.'s  is  exactly  equal  to  1  the  inforn  ^tion  of  both  players 
is  complete.  For  all  other  cases  there  is  said  to  be  incom¬ 
plete  information. 

In  the  sequence  of  repeated  games,  each  player  at¬ 
tempts  to  maximize  his  gain  (minimize  his  loss)  in  the  sense 
of  the  long-run  average  payoff  per  stage.  The  players  are 
generally  not  able  to  learn  exactly  what  the  payoffs  are  at 
each  stage,  but  if  one  player  has  complete  information  he, 
of  course,  can  deduce  them.  The  uninformed  player  must 
try  to  discover  the  payoffs  by  observing  the  strategy  choices 
of  the  opponent,  and  he  may  in  some  cases  approach  a  state 
of  complete  information.  There  are  also  examples  where 
no  information  is  revealed  to  him  by  an  intelligent  opponent' s 
choices.  In  the  latter  cases  the  players  are  simply  playing 
the  game  (one  might  call  it  the  expected  or  averaged  game) 
whose  matrix  is  given  by 

G  =  qjG.  , 

and  the  optimal  strategies  for  both  players  are  determined 
by  the  usual  minimax  solution  to  the  matrix  game  defined  by 

s. 

The  analysis  of  the  repeated  games  with  incomplete 
information  proceeds  as  if  the  first  move  were  a  random 


choice  (by  nature)  of  one  of  the  matrix  games  Gj,G2»  .  • .  , 


G^  .  The  informed  player  is,  in  effect,  told  the  outcome  of 
the  "choice  of  chance"  while  the  uninformed  player  is  not. 

One  of  the  questions  which  the  theory  has  sought  to  answer 
is:  to  what  extent  and  by  which  type  of  strategy  should  the 
informed  player  make  use  of  his  extra  information?  In  a  sin¬ 
gle  stage  of  the  sequence,  the  information,  if  it  is  advanta¬ 
geous,  can  clearly  be  exploited.  However,  it  is  not  clear 
that  the  informed  player  can  continue  to  gain  advantage  from 
his  information  in  the  long  run.  One  of  the  surprising  results 
of  the  work  of  Aumann  andMaschler  is  that,  in  some  situa¬ 
tions,  the  informed  player  must  act  as  if  he  were  uninformed 
if  he  wishes  to  maximize  his  long-run  average  gains.  We 
turn  now  to  the  consideration  of  some  rather  simple  exam¬ 
ples  in  which  these  considerations  may  play  a  r6le  for  the 
patroller  and  the  transitor. 

The  notion  of  games  of  incomplete  information  may 
cast  new  light  on  situations  which  can  arise  in  an  ASW  barrier 
context,  where  the  patroller  (Player  I)  has  certain  infor¬ 
mation  which  the  transitor  does  not  have  --  such  as  the  exis¬ 
tence  of  abnormal  acoustical  situations,  or  the  likely  sonar 
range  under  various  conditions.  The  possibility  exists  that 
the  transitor  (Player  II)  could  infer  information,  which  would 
be  useful  to  him,  by  observing  the  strategy  of  the  patro'ter 
--  i.  e.,  by  observing  how  the  patr^'er  used  his  additional 
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information. 


We  have  therefore  formulated  several  games  of  this 

type. 

Example  1.  The  patroller  knows  detection-range,  the 
transitor  does  not. 

To  be  specific,  we  may  assume  that  the  patroller 
(Player  I)  knows  the  sonar  detection-range,  that  the  transi¬ 
tor  (Player  II)  knows  only  a  probability-distribution  over  the 
sonar  detection-ranges,  and  that  the  transitor  discovers  I's 
patrol  strategy  with  some  time-lag.  For  example,  long¬ 
term  predictions  of  detection-range  will  be  known  to  both, 
actual  ranges  may  change  only  slightly  from  day  to  day,  and 

the  previous  days'  patrolling  orders  may  be  assumed  deci- 
)|( 

phered  and  available  to  the  would-be  transitor. 

Furthermore,  we  assume  that,  when  the  sonar  range 


is  known,  the  detection  game  will  of  the  form  called  G(m,  k) 
in  Chapter  III.  4  of  this  report.  G(m,k)  distinguishes  among 
mk  possible  positions  far  the  patroller,  and  among  mk  cor¬ 
responding  lanes  for  the  transitor.  The  optimal  strategy  for 
Player  I  in  the  game  G(m,  k)  is  given  in  Chapte-*  III.  4  of  this 


report;  the  optimal  strategy  for  Player  II  is  identical;  the 


value  of  the  game  is 


mk+1 

(xn+l)mk 


It  is  usual  to  assume  that  security  and  cryptological 


measures  can  only  delay,  and  not  prevent,  the  enemy's  inter¬ 
pretation  of  a  message.  For  that  reason  we  assume  that  the 
transitor  will  eventually  discover  information  which  the  patrol¬ 
ler  continues  to  use. 
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J 


Now  consider  two  such  games,  defined  as  G{m^,k^) 
and  G{m2,k2),  such  that  =  rr^k^  .  The  matrices 

of  these  games  are  of  the  same  size  --  viz. ,  (m^kp  x 
(rrijkj)  --  and  therefore  a  game  of  incomplete  information 
can  be  defined  with  a  prior  probability  of  playing  the 
game  G(m^ik^)  and  a  prior  probability  q2  of  playing  the 
game  G(m2>k2)  .  An  example  of  the  appearance  of  the 
matrices  with  m^  =  k^  =  3  ,  m^  =  k^  =  2  ,  could  look 
like  this: 


Probability  q^  : 
Gj  =  G(3, 2)  = 


2  1 
l  2  1 
1  2  1 


1  2  1 
1  2  1 


7 

v  =  74 


Probability  (1  -  q^)  =  q^  : 
G2  =  G{2, 3)  =  i. 


3  2  1 

2  3  2  1 

1  2  3  2  1 

1  2  3  2  1 

1  2  3  2  1 

12  3  2 


The  average  matrix,  G  ,  is  then  given  by 
U  =  G1  +  q2.(G2-G1)  . 

To  simplify  the  appearance  of  subsequent  formulae 
we  will  take  x  =  q.,  ,  so  that  q^  =  1  -  x  ,  and  the 
matrix  G  ,  which,  we  shall  denote  by  G(x)  to  recall  its 
dependence  on  x  ,  becomes 


The  value  of  G(0)  is  then  v(G(3,2))  ,  which  is  known 
7  _ 

to  be  24  =  0*2917  ;  the  value  of  G(l)  is  then  v(G(2,  3))  , 

7  ❖ 

which  is  known  to  be  yg-  =  0.  3889  •  Calculation  gives  for 
0  <  >:  i  1  the  value 


v(G(x)) 


189  -  99x  -  45x2  +  llx3 
72  (  9  -  8x  +  xS 


which  is  tabulated  below  as  a  function  of  x  ,  and  is  shown 
in  Figure  3.  8. 


'  too  tedious  to  be  reproduced 
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A,  C  are  intersection  points  of  Cav  v(<T(x))  and  Vex  v(G(x)) 
B  =  Tj,  one-shot  game  with  I  informed 
D  =  r2,  one-shot  game  with  II  informed 


Figure  3. 8 


Value  of  Games  of  Example  1 


v(gW) 


0.0 

7/24 

=  0.2917 

0.  1 

178661/591120 

=  0.3022 

0.2 

2617/8370  - 

=  0.3127 

0.3 

5716/17840 

=  0.3229 

0.4 

17863/53640 

=  0.3330 

0.5 

1037/3024 

=  0.3429 

0.6 

402/1115 

=  0.3526 

0.7 

101423/280080 

=  0.3621 

0.8 

10829/29160 

=  0.3714 

0.9 

2647/6960 

=  0.3803 

1.0 

7/18 

=  0.3889 

When  we  examine  the  convexity  of  this  function  of  x  , 
we  find  that  it  is  a  concave  function;  from  this  fact,  and  from 
the  general  theorems  referred  to  above,  we  conclude  that 
Player  I  (the  patroller),  should  not  make  use  of  his  additional 
information  to  improve  his  strategy  when  the  transitor  could 
observe  the  strategy  --  because  the  transitor  could  then  use 
those  observations  to  deduce  which  stage  game  had  actually 
been  chosen  by  chance  (i.  e. ,  what  the  patroller  had  discov¬ 
ered  about  the  sonar  detection-range). 
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This  result  is,  to  put  it  mildly,  astonishing.  We 
shall  describe  the  optimal  strategies  in  the  games  G(0), 

G^)  ,  G(l)  ,  and  attempt  to  clarify  the  situation. 

Optimal  strategy  in  G(0)  =  G(3,2)  is:  -^(3,  1,  2,  2,  1,  3)  ; 

Optimal  strategy  in  G(l)  =  G(2,  3)  is:  (2,  0,  1,  1,  0,  2)  ; 

Optimal  strategy  in  G(-j)  is:  (’71,  15,  40,  40,  15, 71)  . 

(In  each  case,  the  strategy  cited  is  optimal  for  both  players). 

Now  if  the  game  is,  with  equal  probability,  G(2,  3)  or 

G(3,  2),  but  neither  player  knows  which,  they  must  play 

against  the  expected  outcome  G(-j)  and  the  value  of  the  game 

will  be  0.  3429  .  If  Player  I(the  maximizer)  knows  which  it 

$ 

is,  he  may  profit  by  adjusting  his  strategy  appropriately, 
however,  if  he  does  so  adjust  his  strategy,  and  if  Player  II 
(the  minimizer)  can  observe  his  play  and  deduce  which  game 
is  really  beiig  played,  they  will  find  themselves  in  the  long 
run  playing  either  G(3,  2)  or  G(2,  3)  --  with  values  0.  2917 
and  0.  3889  respectively,  and  with  probability  of  each. 

The  expected  value  of  that  game  to  Player  I  (the  patroller) 

$ 

We  specify  below  the  optimal  strategies  for  that 
one-shot  game. 


-93- 


is  then  2  {0.  2917  +  0.3889)  =  0.  3403. 

It  therefore  profits  the  patroller  to  ignore  the  infor¬ 
mation  he  gets  on  the  detection-range,  under  these  circum¬ 
stances.  If  the  patroller  is  not  concerned  with  his  strategy- 
being  revealed  --  (see  Fig.  3.9)  --  to  the  transitor.the 
fact  that  the  game  might  be  repeated  is  of  no  importance. 
Exact  determination  of  the  optimal  strategy,  and  the  value, 
for  this  one-shot  game  which  we  call  will  tell  us  how 
much  the  patroller  could  benefit  from  knowing  the  true  detec  - 
tion-range  when  the  transitor  does  not  know  it. 

Symmetry  tells  us  that  optimal  strategies  will  involve 
location  in  position  1  as  often  as  in  6,  2  as  often  as  5,  and 
3  as  often  as  4;  we  may  therefore  draw  up  the  matrix  of 
strategies  for  the  "one~shotu  game,  wherein  neither  revela¬ 
tion  of  intelligence  nor  repetition  of  the  same  conflict  are 
significant  factors.  A  strategy  for  Player  I  consists  of  a 
pair  where  r.  =  1,2,  or  3;  r^  =  1  denotes  that 

the  patroller  patrols  in  positions  1  and  6,  r..  =  2  denotes 
positions  2  and  5,  and  r^  =  3  denotes  positions  3  and 
4;  the  first  number,  r  ^  ,  tells  how  player  I  patrols  if  the 
game  is  really  Gj  ,  and  the  second  number,  r^  ,  tells 
how  he  patrols  if  the  game  is  really  G ^  .  The  strategies 
for  Player  II  are  1,2,  or  3,  corresponding  respectively  to 
transits  in  lanes  1  and  6,  lanes  2  and  5,  or  lanes  3  and  4  . 

The  payoff  matrix  for  the  game  is  then  as  shown 
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(we  actually  show  24  to  avoid  fractions): 


0. 3611. 


The  value  of  this  game  is  then  yg-  = 

Although  the  numerical  values  obtained  in  this  one 
particular  case  are  not  in  themselves  of  great  importance, 
they  serve  to  illustrate  several  important  points: 

(1)  if  the  actual  situation  is  either  or  G2  »  with 

equal  probability,  but  neither  player  knows  which,  the  value 
is  simply  v(G(-«r))  because  the  expected  payoff  matrix  is 
simply  ■jGj  +  7  =  G(~r)  *  Value  =  0. 3429  ; 

(2)  if  the  actual  situation  is  either  G^  or  G^  » 

with  equal  probability,  and  the  patroller  knows  which,  but 

the  transitor  does  not,  then  the  patroller  can  patrol  the  end 

lanes  (&1  and  #6)  when  the  game  is  Gj  and  some  mixed 
* 

behavior -strategy  satisfying  (A)  above  when  the  game  is 
G2  ,  then  the  value  is  increased  due  to  his  additional  know-  . 
ledge;  value  =  0.  3611  ; 

(3)  if  the  transitor  is  unable  to  observe  the  detection- 
vange  (i.  e. ,  the  random  choice  of  G^  or  G2)  directly  but  is 
able  to  observe  strategies  of  the  patroller  over  a  long  period, 
he  could  infer  from  the  strategies  as  described  by  (A)  above 
whether  the  game  was  really  G^  or  G2  J  then,  in  the  long 
run,  he  would  be  playing  either  G1  or  G2  (whichever  hap¬ 
pened  to  be  chosen  by  chance)  --  and  they  would  be  equally 
likely,  so  that  the  value  would  be  v(G1)  half  the  time  and 
v(G2)  the  other  half.  The  expected  value  would  then  be 

2  (v(Gj)  +  v(G2))  ;  value  =  0. 3403, 
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From  the  above  numerical  result  we  can  see  that,  in 
this  case,  the  benefit  which  Player  I  could  receive  due  to  an 
advantage  in  intelligence  is  substantial  --  but  that  benefit 
would  be  essentially  nullified  if  the  game  were  played  repeat¬ 
edly  and  the  transitor  could  discover  the  patroller's  past 
strategies.  In  fact,  under  those  circumstances  the  patroller 
should  refrain  from  using  the  information  which  he  possesses. 

Example  2.  The  second  example,  which  is  similar  in 
principle,  results  ir.  answers  which  illustrate  the  opposite 
possibility.  The  example  is  identical  with  the  preceding  ex¬ 
ample,  .but  we  imagine  that  Player  II  (the  minimizer,  who  is 
the  transitor)  can  get  the  additional  information  about  the 
detection -range  (i.  e. ,  about  whether  Gj  or  G 2  was  chosen 

in  the  first  random  move. )  To  Player  II,  the  average  game 
1 

G(y)  with  expected  payment  of  0.  3429,  is  less  desirable  than 
the  opportunity  of  playing  G^  half  the  time  and  G2  the  other 
half  (a  situation  which  has  expected  payment  of  0.  3403). 
Therefore  Player  II  would  benefit  by  using  the  information, 
even  though  he  would  reveal  it  in  using  it.  If,  as  the  opposite 
extreme  case,  Player  II  could  use  the  additional  information 
without  revealing  anything,  the  asymmetrical  one-shot  game 
r2  would  be  played:  the  matrix  of  the  game  F,  is  simply 
the  transpose  of  matrix  shown  above;  the  optimal  stra¬ 
tegy  for  the  well-informed  transitor  is  then 
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£(1.1)  +  ^1.2)  +  |{1.3) 

and  the  optimal  strategy  for  the  uninformed  patroller  is 

7<»  *  S<2>  +  ?(3>  • 

The  value  is  then  =  0.  3263. 

The  results  of  this  example  are  also  shown  in  Figure  3.  8,  p.  91. 

The  above  examples  used  a  special  form  of  the  detec¬ 
tion  range  law  --  viz.  , 

/  p 

-j^(k  -  |i-j|  )  for  i  -  j  £  k  , 

0  if  i  -  j  >  k  . 

This  "triangular"  detection  range  law  results  in  a  discrete 
optimal  strategy  for  each  player,  as  described  in  Chapter  / 

III,  Section  4  of  this  report.  It  also  guarantees  that  the  op¬ 
timal  defense  will  be  equal-gamma  --  i.  e,  ,  that  an  optimal 
patroller's  strategy  would  cause  the  probability  of  detection 
to  be  independent  of  which  transit  lane  was  chosen  by  the 
transitor. 

We  have  already  considered  the  unilateral  incomplete 
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information  game,  in  which  the  patroller  knot's  detection- 
range  but  the  transitor  dees  not,  and  the  transitor  will  dis¬ 
cover  the  patroller's  strategy  in  the  long  run.  Because  the 
transitor  is  always  facing  an  equal  level  of  defense  in  each 
lane  for  either  of  the  stage  games,  it  is  not  surprising  that 
(i)  he  also  faces  an  equal  level  of  defense  in  each  lane  for  the 
games  of  incomplete  information  obtained  from  combination 
of  those  stage  games,  and  (ii)  the  value  of  those  combination 
games  is  close  to  the  value  which  would  be  expected  if  the 
stage  game  (in  the  above  examples,  the  detection- range) 
were  revealed  immediately  to  both  players  as  soon  as  the 
random  choice  was  made. 

We  will  now  proceed  with  examples  where  the  stage 
games  are  not  equal-gamma  games,  where  intuition  is  less 
useful  in  predicting  good  strategies  for  the  unilateral  game 
of  incomplete  information  and  where  the  better -informed 
player  might  be  able  to  reap  a  substantial  profit  from  his 
intelligence  advantage. 

In  all  these  examples  we  continue  to  suppose  that 

Player  II  must  select  a  lane  through  which  he  attempts  to 

transit  undetected,  and  that  Player  I  must  select  a  position 

i  in  which  to  patrol.  The  probability  of  detection  if  I  chooses 

i  and  II  chooses  j  is  g..  and  the  array  (g..)  makes  up 

ij 

the  matrix  G  .  The  several  alternative  stage -game  matri¬ 
ces  are  called  ,  G^  >  •  •  • »  GQ  • 
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Example  3.  If  there  is  a  "blind"  l«.ie  with  smaller 
detection -probabilities  than  elsewhere,  which  is  a  priori 
equally  likely  to  be  any  one  of  the  lanes;  and  if  the  expected 
detection -probabilities  in  each  lane  are  such  that  the  "ex¬ 
pected  game"  G  has  an  "equal-gamma"  solution,  then  it 
seems  intuitively  obvious  that  the  transitor  could  profit  by 
any  hint  (however  small)  as  to  which  lane  was  actually  the 
blind  one  --  consequently  we  would  expect  the  payoff  of  such 
a  G(q)  ,  as  a  function  c  !  q  to  be  a  concave  function.  We 
can  illustrate  this  with  a  small  example,  and  then  prove  it 
for  the  simple  special  case  in  which  a  transitor  can  only  be 
detected  by  a  patroller  in  the  same  lane  --  so  that  all  the 
matrices  Gj  ,  G^  ,  ...  ,  Gq  ere  diagonal. 


Take 


°l  *  3 


/  4  1  0  \ 


1 

\° 


1 

2 

1 


G2  =  4 


/  2 

1 


\ 


0  1 


0  \ 
1 
4 


and  G(x)  =  (l-x)G^  +  x  G2 
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Then,explicitly  ; 


S(x)  =  4 


1 4  -  2x 


1 

0 


1  0 

2  +  2x  1 
1  4 


strategy  for  Player  I  in  ?T(x)  is  optimal 


(8  +  4x  -  2X2)  *(2  +  4x,  4  -  3x,  2  +  3x  -  2x  )  ; 


optimal  strategy  for  II  is  the  same;  and  the  value  is 

f  12  +  9x  -  8x2 
^  32  +  I6x  -  8x^ 

This  io  tabulated  below  and  ««  sr.O"  a  in  Figure  3.  10  on  p..  103.. 
Note  that 

v{G(0))  =  v(G1)  =  |  =  0.3750  , 
v(G(l))  =  v(G2)  =  ^  =  0.3250 
v(G(^))  =  =  0.3816  . 

(Incidentally,  v(£T(^-))  is  not  only  greater  than  the  average 
between  v(G^)  and  v(G2)  *  hut  is  in  this  case  actually 
greater  than  eithe  r  of  them. } 
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X 

v(G(x)) 

0.0 

.3750 

0. 1 

.3825 

0.2 

.  3865 

0.3 

.  3875 

0.4 

.  3858 

0.5 

.3816 

0.6 

.  3750 

0.7 

.3661 

0.8 

.3548 

0.9 

.3412 

1.0 

.3250 

Value  of  the  Games  of  Exampl«3 

Now  the  value  of  a  diagonal-matrix  game  of  positive 
elements  g^,  . ..,  g  is  simply 

v(<3)  =  (s.g.."1)-1  . 

By  appealing  to  continuity  of  the  elementary  functions, 
it  suffices  to  prove  our  assertion  for  cases  where  only  one 
entry  (say  )  changes. 

- 1  n  -1-1 

Because  the  second  derivative  of  (g^  +  ) 
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with  respect  to  is  negative,  the  desired  result  follows. 

Example  4.  In  this  example,  we  take  a  pair  of  stage 
matrices  which  represent  different  detection-rar.ges,  but 
which  do  not  have  the  "triangular"  property  of  the  matrices 
in  Examples  1  and  2.  Specifically, 


It  is  easy  to  verify  that 


d 

Hx 


(-1)  .(-1) 

+  y ?  •* 


2 


=  ±  I  1 

dx  71  I 

\{1  +  xy) 

'  \ 

=  ,  -2  \  y  ,  which  is 

always  negative  for  positive  x  and  y  . 
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'  jJBftMM>«wi'W"i» ■i«*u*«**'y <•» 

I 


G0  = 


1 

7 


1 

3 

1 


1 

3  1 
1  3 
1 


1 

3 

1 


and  £r(x)  =  (1  -  x)  Gj  +  x  G^ 
Explicitly* 


^(x)  =  -i 


Since  we  are  again  dealing  with  a  symmetric  matrix,  we  can 
describe  the  strategies  for  each  player  by  a  triple  of  numbers: 
the  first  is  the  probability  of  playing  row  1  +  -j-  row  6)  ; 
the  second  is  the  probability  of  playing  (^row  2  +  row  5); 
and  the  third  is  the  probability  of  playing  row  3  +  ^  row  4). 

The  optimal  strategy  for  Player  I  in  the  game  G(x) 
is 
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Note  that 

v(?J(0))  =  vJGj)  =  £  =  0.1667  , 
v(S(l))  =  v(G2)  =  to  =  °*2544  • 
v^-j))  =  -jto  =  0.2119  . 

Values  of  v(£T(x))  for  Example  4  are  tabulated  below 
and  graphed  in  Fig.  3  .11.  Note  that  the  function  v((?(x)) 
is  again  concave,  so  that  again  it  would  be  prudent  for  the 
patroller  to  ignore  information  he  might  obtain  as  to  which 


of  these  would  be  the  true  detection-function. 
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